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ABSTRACT 



A representation of the Uni vac AN/UYK-20 computer sys- 
tem has oeen emulated on the Burroughs D Interpreter-Based 
System. The entire AN/UYK-20 instruction reoertoire has 
been emulated with the exception of the ‘math pac ' option 
(float inq point arithmetic ana cordic functions)* clock and 
interrupt codes* and inout/output operation codes. Modular 
design with extensive documentation has been imolemented 
throughout orogram develocment allowing for ease of modifi- 
cation and further extensions to the existing emulation. 
Emulation and the hardware architecture of the AiM/UYK-20 and 
the Burroughs D-machine* are discussed in conjunction with 
the An/UYK- 20 emulation itself. Methods of testing and 
deDuqging, samole test proarams and recommendations for con- 
tinued design modifications to the emulation are oresented. 
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INTRODUCTION 



A. STATEMENT OF THE PROBLEM 

The Navy has been challenaed with maintaining the 
newest# most efficient tactical data systems consistent with 
the continually increasing demands and requirements of the 
real-time environment. There is an extensive conversion 
effort reauired to change from existing systems to newer 
more sophisticated technology such as the AN/UYK-20. 
Inherent in uograding to a new system is the complex 
software redesign and modification process which is often 
hindered by the absence of the new computer system. 

Unfortunately# the demands of a military installation 
reauire software generation prior to implementation of an 
upgraded computer system. One solution to this problem is 
to utilize for software development an intermediate computer 
system wnich has the capability of emulating the anticipated 
target machine. This provides a vehicle for software 
design# development# and test i nq prior to transitioning to 
the new system. 

Currently# the Naval Postgraduate School (NPS) Computer 
Science Department maintains a Burroughs Interpreter-Based 
System known as the Burroughs D-machine. The D-machine is 
capaole of oeinq m i c r op roar ammed to emulate any of a miriad 
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of target machines. It effectively enables students to 
create their own comouter knowing only the machine instruc- 
tion repertoire for the control unit in the target machine. 

The problem presented was to develop a feasible working 
model of the AN/UYK-20 on the m i c r op rog r ammab 1 e Burroughs 
O-machine. The project orovided an opportunity to obtain 
practical exoerience with contemporary hardware and to mani- 
pulate writeaole control stores to imitate a Navy tactical 
c ompu ter. 

B. APPLICATIONS OF THE AN/UYK-20 

The Uni vac AN/UYK-20 minicomputer is a general purpose 
militarized digital computer adaptable to numerous tactical 
apol ications. The AN/UYK-20 has been successfully utilized 
in many tine-critical ? real-time systems including fire con- 
trol radar/ communication controllers/ sianal processing 
analyzers for sonar and beacon signals/ and numerous weapons 
control systems. A suoseguent chanter will be devoted to 
the technical asoects and internal design of the AN/UYK-20. 

Because of its size/ ruggedness/ and comouting capabili- 
ties/ the AN/UYK-20 has been designated the Navy’s standard 
tactical minicomputer (lbl. It was selected for emulation 
in order to provide a feasible platform for software 
development to those military installations either contem- 
olating or in the process of receiving an AN/UYK-20. 
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A workable emulation would allow military apol i cat i ons 
such as data reduction* navigation* telemetry* sensor pro- 
cessing* ranae tracking and logistics to have software pack- 
ages developed* tested* and modified prior to arrival of the 
AN/UYK-20. Furthermore* it would permit personnel to become 
familiar with the machine by providing advanced training* 
thereby easing the transition phase to the new system. 

C. PROJECT DESIGN OBJECTIVES 

Several design techniques were used throughout the 
development of this project: 1) modularity* 2 ) structured 
programming* and 3) extensive documentation. These design 
features will aid the interested reader as well as simplify 
any future extensions or modifications to the existing emu- 
lation. 

Modular design was utilized by creating independent pro- 
gram segments which were individually developed* debugged* 
and tested. These modules or subroutines provided a strong 
foundation which were readily modified throughout the entire 
programming effort. Conceptually* the emulation was divided 
into relatively small entities which were further reduced to 
program segments rarely exceeding one page in length. 

Structured programming was demonstrated by utilizing a 
limited number of control flow structures and maintaining a 
common logical desicn throughout the entire emulation. 



Comprehensible code held precedence 



over 



extremely efficient 



code 



In addition to mooularity and structured programming, 
the entire programming endeavor was suDpl emented with exten- 
sive commenting to provide the necessary sel f-documentat ion 
to promote and facilitate program translation, modification, 
and fusing witn the other i ndeoendent modules. These con- 
cepts promoted the extensive team effort required to achieve 
the research goals. In addition, it will provide ease of 
program maintenance and modification in the future. 
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II. emulation 



a. historical BACKGROUND 



The term microprogramming was first utilized in an arti- 
cle by Professor M. V. Rilkes of the Cambridge University 
Mathematical Laooratory in 1951 [241. His paper concen- 
trated on a control section within the computer which, when 
P rog r amm a t i c a 1 1 y controlled, performed reqi ster-to-regi ster 
data transfers sequentially and in parallel for the execu- 
tion of a single machine instruction. A sequence of opera- 
tions (microinstructions) required for execution of a 
machine instruction is considered a m i c r op roq r am . 



T r ad i t i ora 1 1 y , tne computer has been composed of essen- 
tially five components: the arithmetic/logic unit, the con- 
trol unit, memory or storage, input, and output (Figure 
1 1 — 1 ) . Tne control section sets the prooer conditions for 
the opening and closinq of required gates in the logic net- 
work. Historically, the control section has been hardware 
consisting of a series of decoders and flip-flops along with 
their associated circuitry. Therefore* every machine 
instruction hao a fixed interpretation which was hardwired 
within the control unit. 



In 1 Q 5 7 , .'Hike’s 
slightly modified. It 



definition of m i c roP rog r amm i ng was 
was defined as a technique of design- 
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i n g the control circuits of an electronic digital comouter 
to interpret and execute a given set of machine ooerations 
as an equivalent set of micro-operations [151. 

The hardwired control section can be modified by inter- 
changing ROM modules or other hardware components* by 
replacing the control section with a programmable (dynami- 
cally writeable) control store which in itself is a separate 
word-organized memory (Figure 11-21 or by combining both 
aoproaches. A orogrammable control store allows rapid 
changes in the machine's instruction repertoire while main- 
taining maximum design flexibility. The resulting computer 
system is m i c top rog r ammab 1 e and capable of storing a series 
of changeable machine oersona 1 i t i es . 

The comouter control store can thus be modified to allow 
trie execution of machine 1 anquaae programs intended for a 
varietv of machine architectures. This process can be com- 
pared to reolacing hardware components found in convention- 
ally designed computer systems. The primary advantage of 
m i c roorogrammed logic is the capability to perform various 
control seauences without hardware modifications. The pro- 
cess through which the hardware components of one machine 
(host) are made to imitate the specific hardware charac- 
teristics of another machine (target) is known as emulation. 




DATA > 

CONTROL * 

Figure 1 1 - 1 [11] 
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OUTPUT 



> ARITH/LOGIC 



MICROMEMORY 



DATA > 

CONTROL > 



Figure I I -2 
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Emulation allows the computer scientist to create vari- 
ous machine architectures from a single microprogrammable 
host. The complete set of m i c rop r og rams (firmware) and the 
necessary hardware, as well as the reguired software, added 
to one computer system enabling it to execute programs 
designed for another system is known as an emulator. 

B. microprogramming 

Computer manufacturers have made available numerous 
microoroarammaDle machines which permit the user to tailor 
his instruction repertoire to meet the neeos of his Particu- 
lar application. Some examples of microprogrammable com- 
puter systems are the Burroughs D-machine# the Nanodata 
QM - t , the Vari an 73, the Standard Logic CASH-8, or the 
He w 1 e t t -Pac x a rd 2100. These m i c roo rogrammab 1 e systems pro- 
vide the benefits of flexibility, lower system costs and a 
systematic approach to system design if utilized effec- 
tively. 

dhen a manufacturer desiqns a dynamically writeable con- 
trol store, the amount of parallelism to be allowed must be 
determined. Parallelism is defined to be the simultaneous' 
control of numerous hardware resources. There are basically 
three forms of control: vertical, horizontal, and residual. 
In vertical m i c roo r og r amm i no , each instruction controls a 
single operation with oroaram flow being sequential, unless 
the instruction was a conditional or unconditional brancn. 
By contrast, a horizontally m i c r oo rog r ammed machine is 
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proqrammed via instructions which simultaneously control 
multiple resources including condition testing and microin- 
struction sequencing. 

Horizontal microinstructions usually are not encoded 
which means each bit controls one machine resource or opera- 
tion. They usually have a wider word than vertical instruc- 
tions and consequently consume more memory. Vertical 
instructions are usually encoded with one or two levels. 
Encoding means the value of a control field in the microin- 
struction is a binary code soecifyinq which resource or 
operation is to be performed. The horizontal microinstruc- 
tions have the potential of being much more efficient 
resource managers and conseouently are more difficult to 
optimally design than their vertical counterparts, 

Combining the attr inutes of horizontal and vertical 
microprogramming results in residual control. This method 
saves memory by using vertical microinstructions while 
simultaneously controlling multiple parallel resources via 
set up r eg i sters. 

Microinstruction implementation severely effects the 
speed of microprogram execution. In serial implementation^ 
one microinstruction is fetched and fully executed prior to 
fetching the next instruction. This technique offers the 
advantage of logical simplicity while suffering from lack of 
efficiency since it consumes the maxim u m amount of time. 



'Parallel' implementation permits fetching of 



the next 



instruction before termination of the Previous instruction. 
The obvious advantage is execution soeed which is of utmost 
importance when emulating another machine (Figure 1 1 - 3 ) (23. 

Another significant microprogramming characteristic is 
the number of chases used in the execution of each microin- 
struction. A monophase system means there are no subdivi- 
sions of the basic clock pulse and consequently each 
microinstruction is controlled by the transmission of the 
leading edge of a clock cycle. In a polyphase implementa- 
tion scheme# the basic clock cycle is subdivided into minor 
phases which are independently generated via hardware. 
Altnough polyphase operations are more complex and require 
complicated control# they do permit faster resource manipu- 
lation when thev are efficiently cooed by allowing multiple 
operations to be performed during the same phase(s). 

The microorogrammability of a given computer and the 
capabilities of its associated mi crooroqrammi nq language are 
directly effected by the the presence or absence of each of 
the alternative microprogramming characteristics described 
above. The microprogramming language spectrum ranges from 
the lowest level or microl anouage through the assembly 
languages to the high level procedural languages. 



IF 



Microinstruction 
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i + 1 
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a conditional branch 
microinstruction 



» 

(c) Time 

(a) Serial fetch and execute. 

(b) Parallel fetch and execute. 

(c) Combined serial-parallel where next address 
depends on conditions in present cycle. 



Figure 1 1 - 3 [?) 
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The problems of m i c pop roa r amm i nq can be significantly 
reduced if suitable software supoort exists and is readily 
available. This support is usually in the form of simula- 
tors and debuggers. Typically* a simulator provides an 
alternative to assembly level coding by permitting the user 
to code in a higher level language and yet achieve the same 
results at the expense of some added memory and execution 
time. Debuggers are extremely useful in the developmental 
stages of microprogramming especially for new and experimen- 
tal system design. Debuggers permit dynamic access to the 
machine status and register contents at the instant they are 
employed? i.e. a trace feature. Some debuggers offer the 
opportunity of assembling in-line. This option can drasti- 
cally reduce required debugging time. 

The primary application of microprogramming is to imple- 
ment the necessary control structure required for the 
analysis and execution of machine level instructions by 
means of programmed control stores rather than hardwired 
logic. Therefore? a dynamically microprogrammable computer 
can provide a software development system which can be a 
cost-effective approach to experimentation with potential 

candidates for replacement computer systems or the design of 

r 

completely new systems to fit the needs of unique applica- 
tions. 
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C. THE GOALS OF EMULATION 

A wel 1 -desiqned emulation can provide an opportunity to 
experiment and create software for new computer systems 
before the actual hardware is available. The utilization of 
an emulator can almost eliminate reorogramming/ consequently 
smoothing the system transition period. In addition/ emula- 
tion has provided a workable model of new systems under con- 
sideration for procurement/ providing a much more detailed 
cost-benefit analysis of system conversion. 

Furthermore/ it is often economically sound to emulate a 
second generation computer with a third generation system. 
This provides growth to a contemporary system while fulfil- 
ling the reauirements of the past in a cost-effective 
manner. However/ this can be a disadvantage if the program- 
ming staff uses the emulation as a link to the old system 
and consequently fails to take advantage of the attributes 
of the new system. 

D. EMULATION VERSUS SIMULATION 

To accomplish the emulation objectives/ certain design 
features must be incoroorated into an emulation. Naturally/ 
execution time and allocated memory are ’the two foremost 
considerations. Traditionally/ the concept of mimicking 
another computer has been accomplished by either a simulator 
or an emulator/ two concepts often confused with one 
anotner. 
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A simulator is a series of high level language (HLL) or 
assembly language statements which individually do not 
behave like the target machine instructions. The host 
machine executes its own native instructions in order to 
imitate the target machine operations. Consequently/ simu- 
lation is a rather slow technique because it requires an 
intermediate translation. In addition, simulation of cer- 
tain instructions such as bit manipulation and shifting 
operations can require an enormous amount of intermediate 
code generation demanding a significantly larger memory 
a 1 location. 

An emulator is a microprogram that is executed on the 
host machine, performing machine instructions of the target 
machine. Since an emulator accepts the binary object code 
of the target machine and directly executes these instruc- 
tions, it can be extremely efficient in terms of time and 
soace requirements. The execution time of an emulation is 
dependent uoon many factors: clock rates of the two 
machines, f reauency of memory references, high soeed shift- 
ing compatibility, reauired register mapping between tarqet 
and host machines, bit manipulation capability of the two 
machines, condition code selection and testing, flexible 
data oatn selection capability, interrupt similarities, 
inDut/outout comoa t i p i 1 i t y , and microprogramming efficiency. 
If the haraware features between target and host machines 
are extremely compatible and hiqhly efficient microprogram- 
ming has been employed, an emulation performance ratio (host 
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to target) of nearly one to one can be attained. This emu- 
lation performance ratio (EPR) has been demonstrated by the 
emulation of the SKC-2070 on the Nanodata QM-1 computer. It 
is possible to achieve an EPR better than one to one under 
ideal situations, when the host machine has a much faster 
internal operation execution rate [11. 

Several distinct advantages can be realized using emula- 
tion as compared to simulation. The execution speed is sig- 
nificantly better by at least an order of magnitude. The 
target machine reoresentation in firmware is closer to the 
actual hardware design and total access to the lowest 
machine level is achievable. Perhaps most noteworthy, emu- 
lation provides the opportunity to rapidly create test beds 
for numerous machine architectures and provide a basis for 
new system development. 

E. EMULATION TECHNIQUES 

Traditionally, there have been three approaches for emu- 
lating machine instructions: 1) hardware or firmware assis- 
tance to a software simulation as demonstrated by the IBM 
3 6 0 / o 5 emulation of the IBM 7090, 2 ) independent host system 
hardware or firmware which provides for complete execution 
of the target machine’s instruction repertoire of which the 
Burroughs D-mach i ne emulation of the AN/UYK-20 is an exam- 
ple, ana 3) an auxiliary processor which is operated in con- 
junction with the host machine to execute target machine 
instructions [ 1 *U . 
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Software-controlled emulation is usually characterized 
bv categorizing the target machine instructions into three 
distinct classes: easily emulated instructions* complex 
instructions not readily emulated* and those instructions 
not deemed necessary for the desired a poli cation. Instruc- 
tion usage is significant in t h f s classification process. 
Each class of instructions becomes a candidate for direct 
hardware or firmware imolementation. The first emulated 
function in this aDorcach is usually the fetch and analysis 
ooeration. After the instruction is analyzed* the appropri- 
ate oocoae subfunction can be executed. 



An alternative emulation technique is the firmware- 
controlled method. This approach is identified by having 
system control reside comoletely in firmware or hardware 
during the emulation process. All instructions are executed 
on the host machine as if they were indigenous to the target 
machine. This method is much more efficient than the 
software-controlled techni gue? however* it is more expensive 
and the cost differential is directly related to the 
required performance level. Performance is dependent upon 
the number of required data paths* arithmetic units* and 
other additional logic circuitry which must supplement the 
host machine architecture. 

Upon entering the emulation mode in a firmware- 
controlled emulator, the machine Performs like the target 
machine until encountering an exit situation. There exists 
three exit modes: 1) priority interrupt* 2) not implemented 
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instruction, and 3) deliberate exit because of a debugging 
rout ine. 

The third emulation technique consists of utilizing aux- 
iliary hardware electronically attached to the host computer 
for the sole purpose of executing target machine instruc- 
tions. In effect, a target machine is composed of host 
machine hardware with the necessary additional components 
required to create an effective emulator. 

F. EMULATION HARDWARE 

The development of writeable control stores and 
microprogramming techniques have significantly influenced 
computer design. This section will describe some of the 



available dynamically 


microproqrammabl e 


h a rdwa r e 


(Figure 


1 1 -a ) . 










The 


He w 1 ett-Pacxard 


2100 is a general 


purpose 


minicom- 


pu t e r . 


It has a unique control store divi 


ded into 


two seg- 


men t s . 


One section is 


RO^ and the other 


section 


i s user 



programmable. The machine is vertically microprogrammed 
using a standard 80 instruction machine language repertoire. 



A debugger and 


assembler assist 


the 


user 


in microprogram 


de ve 1 ooment 12 J . 










The Standard 


Logic CASH-8 is a 


h i gh 


speed 


digital con- 


troller with a 


seoa rate cont ro 1 


store 


. It 


consists of 16 



general purpose registers and an accumulator. The CASH-8 is 
vertically microprogrammed but does not support any language 
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above m i c ro 1 anquaqe [21 . 

The Van' an 73 is a general purpose minicomputer that has 
a 150 instruction set. The horizontal microinstruction con- 
sists of 64 bits with 25 fields/ some of which indicate 
register transfers# ALU operations# shifting# control store 
addressing, condition testing# I/O control and memory opera- 
tions. The Vari an 73 contains both a ROM control store and 
a wr i teabl e control store loadable from main memory. A 
microprogram assembler and interactive simulator are avail- 
able [21 . 

Tne Nanodata Q M - 1 is unique in that it contains both a 
control store and a nanostore which are Doth loaded under 
user program control. The lfl-bit vertical microinstructions 
are stored in the control store# fetched and then inter- 
preted under nanoprogram control. A horizontal nanoinstruc- 
tion is 3 b 0 bits which is subdivided into five 72-oit vec- 
tors. Assemblers for both m i c rop r oq r am s and nanoProqrams 
are available [21 . 

The previously described machines represent a small sam- 
ple of the available m i c rop rogr ammab 1 e computer architec- 
tures. The availability and flexibility of these computer 
systems has stimulated demand for these devices. Conse- 
quently# hardware manufacturers have been compelled to pro- 
duce wri teabl e control store equipment to satiate the needs 
of t h e computer market. 
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CONTROL STORE MICROPROGRAMMED USER MICROPROGRAMMABLE 
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RCA Spectra 70 Model 45 



None 



Preparation of Provision of 

User Microprograms Translator or 

Simulator 



SUPPORT AVAILABLE TO USERS 



Note: Relative microprogrammability is the distance from the origin 

to the machine point in two space. 



Figure I I - ^ 121 
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AN/UYK-20 ARCHITECTURE 



Constructing an efficient emulation requires a precise 
understandina of the architecture and performance charac- 
teristics of the machine being emulated. An emulation must 
attempt to match the target machine's features and maintain 
its flexibility of hardware design as closely as possible. 
Although it is not reaui red# an operational demons t r a t i on of 
the emulated machine can solve many emulation questions. 

In emulating the AN/UYK-20/ architecture and perfor- 
mance criteria were derived from technical publications/ 
since an actual machine was unavailable. When inconsisten- 
cies appeared in the documentation/ specific questions were 
posed to a UNIVAC field engineer/ who often tested proqrams 
on the AN/UYK-20 to resolve inconsistencies. Documentation 
coupled with an expert consultant provided sufficient infor- 
mation for emulating the AN/UYK-20 successfully. 

The intent of this chapter is to outline those features 
of the AN/UYK-20 significant to the emulation. A detailed 
hardware description can be found in Refs. 20/ 22 , 28. 



A. HARDWARE DESIGN 



The AN/UYK-20 was designed for the Navy to fulfill the 
requirements for small or medium size general purpose data 
processing in shipboard# mobile shelter, or other military 
environments. Soerry Univac incorporated minicomputer tech- 
nology in constructing the AN/UYK-20, including MSI circui- 
try aesign, m i c rop rog rammed control, memory modularity, and 
asyncnronous or synchronous i nput /output channels. 

The AN/UYK-20 had to be extremely flexible in its appli- 
cations, offering a wide range of configuration possibili- 
ties which were derivatives of the basic design. Modular- 
ity, a concept highly desirable in a military environment, 
was achieved by offerinq options that could be easily added 
using printed circuit cards and/or memory modules. 

The AN/UYK-20 can accomodate up to eight 8K, sixteen-bit 
word boards of magnetic core storage with an access time of 
750 nanoseconds. The central processor is controlled by a 
programmaole micromemorv which can be expanded by an addi- 
tional 512 words. The microoroqram controller is programmed 
at the factory, but the additional micromemory option is 
user defined. Both sections of micromemory are programmed 
using fusible links, and once oroqrammed they are completely 
static (Figure Iil-l). 

A memory interface is responsible for the transfer of 
data to memory from the central processor (CP) and 
input/outout controller (IOC). Both the IOC and the CP are 
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capable of accessing all of memory (65/536 words maximum). 
The addition of direct memory access (DMA) provides a second 
memory interface and an additional access port which is con- 
nected to each of the two 32K memory segments. 

The i nput/outout controller permits the central proces- 
sor to communicate with the external devices without 
interfering with prop ram execution. The IOC has a maximum 
of 16 parallel or serial channels. Parallel nata transfer 
takes place asynchronously using 8-bit/ 16-bit/ or 32-bit 
transfers. Serial interfaces are either synchronous or 
asynchronous/ with word-to-serial or seri al -to-word conver- 
sions occurring in the IUC. The IOC and CP compete for 
memory access through the memory interface with priority 
given to the IOC in the event of a simultaneous request. 
The IOC is permanently assigned several memory addresses for 
command word and interrupt word storage. 

The addressable high-speed registers available in the 
AN/U YK -20 include the oroqram address register ( P- reg i s t e r ) / 
two 16-bit status registers (SRI and SR2)/ a real-time clock 
register (32- bits)/ a monitor clock register (16-bits)/ and 
a set of sixteen 16-cit general registers. An additional 
stack of 16 general registers is an available hardware 
oot ion. 

The sixteen general registers were included to enhance 
the speed and performance of the AN/UYK-20, allowing most 
programs to use a great proportion of reqi ster-to-regi ster 
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instructions. These general registers can be used as accu- 
mulators for arithmetic# shift# or logical functions# as 
index registers# or as temporary storage locations. The 
second set of general registers can be readily employed via 
a status bit. This status bit designates which general 
register stack is to oe utilized. The duplicate set of gen- 
eral registers yields dividends in a multi-task or heavy- 
interrupt processing environment. This additional register 
set can be used to provide high-soeed temporary storage# 
thus avoiding slower main memory storage of working vari- 
ables. 

The two 16-bit status registers and the program address 
register represent the machine status of the AN/UYK-20. 
when these registers are collectively referenced# they are 
called the program status word ( 4 8 - b i t PSrt). The P-register 
indicates the next instruction to be executed. This 
instruction may be a lo-bit single-word instruction or a 
32-oit double-word instruction. Program control can be 
modified by using an instruction which manipulates the con- 
tents of the P-register. 

Status register l contains bit information concerning 
condition code settings# overflow# and carry bits# interrupt 
codes# and numerous other machine indications (Figure 
III-2) . 
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AN/UYK-20 FUNCTIONAL ARCHITECTURE 




Figure III-l [ 22 ] 



12 



STATUS REGISTER 1 




SR BITS 
8 and 9 

00 

01 

10 

11 



CONDITION CODES 
ARITHMETIC 
0 

>0 (POS) 

Not Used 
<0 (NEG) 



COMPARE 
;R a '» (R m )of <Y) 
(R a )X R m )°r<Y) 
Not Used 

(f V <(R m ,or (Y) 



Figure I I I - 2 f23] 
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Status register 2 holds control bits for direct or 
indirect addressing, and holds interrupt codes- Interrupt 
processing routines set bits in the interrupt code field 
corresponding to the IOC interrupt (Figure III-3). 



STATUS REGISTER 2 



15 14 


13 12 


ii 10I9 a | 7 6 5 4 3 2 1 


1 0 1 






1 INTERRUPT CODE 




1 


INDIRECT CONTROL 8ITS FOR R 10 






INDIRECT CONTROL BITS FOR R, 2 




INDIRECT CONTROL BITS FOR R 14 




INDIRECT CONTROL BITS FOR R, g 



INDIRECT CONTROL BITS MEANING 

00 Normal Addressing. 

01 Normal Addressing. 

10 Indirect Addressing w/o indexing; 

IW1 at V * y. 

11 Indirect Addressing with indexing; 

IW1 at Y = y + (Rm) . 



Fiaure 1 1 1 - 3 (231 

The real-time clock and monitor clock registers provide 
program-control I ed interrupt capability which is useful for 
timing and synchronizing program segments with real-time 
events. The real-time clock C R T C ) is a 32-bit register used 
as count-up storage while the monitor clock (i^ON) is a 
16 -pit count- down register. A one kHz internal oscillator 
controls the counting speed of both registers. An optional 
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external clock operating at a frequency up to 50 kHz is also 
a va i 1 ab 1 e . 

Interruot Drocessina in the AN/UYK-20 is conducted using 
a priority level scheme which classifies interrupts into 
three priority levels (classes). Interrupts within the same 
class are assigned a priority ranking and a code which iden- 
tifies which processing routine to execute. During inter- 
rupt handling, all interrupts of the same level or lower 
level are locked out until the CP is completed processing 
the current interrupt. Higher priority interrupts can over- 
ride the lockout and cause the CP to honor them first, hold- 
ing the lower level interrupts in abeyance until higher 
level interruot Drocessina is completed. The highest prior- 
ity interrupts are hardware malfunctions, followed by 
software interrupts, and at the lowest level » IOC inter- 
rupt s . 

Permanent locations in memory co r r esoond i na to each 
interrupt class hola the PSrt and RTC when an interrupt is 
being honored. Likewise* other permanent memory addresses 
assigned to each interrupt class hold the appropriate inter- 
rupt routine entrance location to be loaded into the PSl\. 

Memory adoressi na is accomplished using t> 4 page address 
registers which separate memory into 1024-word pages. Abso- 
lute addresses are formed by isolating the upper six bits of 
the relative address to find the cage address register 
number, and then concatenating the lower six bits of the 
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page address register contents with the lower ten bits of 
the relative address (Fiaure III-4). Any operation that 
stores into memory sets the most significant bit of the page 
address register used in generating the address. 

Some additional hardware features of the AN/UYK-20 are 
those functions available on the maintenance panel of the 
machine itself. These include a breakpoint feature which 
allows an ooerator to insert from the panel an address which 
causes the AN/UYK-20 to stop execution when the selected 
address is referenced. Other available toggles allow halt- 
ing execution p rog r amma t i c a 1 1 y using Key l or Key 2 on the 
maintenance panel. These additional hardware features are 
useful debugging tools. 
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MEMORY ADDRESS GENERATION 
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SELECTS ONE OF !OOg 
PAGE AOORESS REGISTERS 




page modification INDICATOR 



Figure 1 1 1 -a [22] 
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INSTRUCTION FORMATS AND REPERTOIRE 



1. Repertoire of Instructions 

The AN/UYK-20 instruction set is composed of nearly 
26 0 separate instructions designed to be both versatile and 
comprehensive. Both single-word (16-bit) and double-word 
(32-bit) instructions are available* Some of these instruc- 
tions are specifically desiqned to meet the requirements of 
a real-time environment. A few sample instructions include: 

a. Local jumo - used to facilitate looos# sav- 
ing several steps in program execution. 

b. Reverse register - used in a communication 
environment when data is received in one 
sequence but must be transmitted in reverse 
sequence. 

c. Set o i t # clear bit* and test bit - used to 
test inoividual bits in registers saving 
considerable execution time in programs that 
communicate via flags and status words. 

Additional flexibility is provided when the 'math 
pac 1 hardware option is included in the AN/UYK-20 configura- 
tion. Some 33 additional opcodes are addeo to the instruc- 
tion set in order to increase the computational capabilities 
of t ne machine. An instruction for square root# double pre- 
cision multioly/oiviae instructions# as well as hardware 
trigonometric and hyperoolic functions which utilize coordi- 
nate conversion algorithms (cordic) are included. 
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Single-word instructions are generally employed when 
manipulating operands in high-speed registers. Double-word 
instructions are used in operations involving direct or 
indirect addressing with or without indexing/ or supplying 
16-bit constants to programs. Typical instruction speeds 
for a single-word versus a double-word instruction are: 





SINGLE- 


WORD 


DOUBLE-WORD 


ADD 


.75 - 1 .5 


usee 


1.5 - 2.25 


usee 


LOAD 


.75 - 1.5 


usee 


1.5 - 2.25 


usee 


MULTIPLY 


3.8 - 4.0 


usee 


4. 4 - 4.6 


usee 


DIVIDE 


6.8 - 7.0 


usee 


7.4 - 7.5 


usee 



Nearly all instructions affect condition Pits in 
status register 1. The AN/UYK-20 sets these bits as a 
result of two types of operations. Most instructions that 
do not involve compare logic are categorized as arithmetic 
instructions. when the result of the arithmetic operation 
is determined and is about to be stored in memory or a 
register/ a conoi t i on code is set indicating whether the 
result is positive/ negative/ or zero. Compare instructions 
set the condition pits based on a greater than/ less than/ 
or equal to comparison of two registers or a register and 
the contents of a memory address. 

Two other bits in SRI are set as a result of compu- 
tational or shifting instructions. The overflow designator 
is set whenever an arithmetic or shift operation requires 



more 



Pits than are provided for in a register (16 bits) 



The carry designator is set when an arithmetic operator gen- 
erates a carry beyona the most significant bit of the regis- 
ter. 



The AN/UYK-20 allows five different types of operand 
formats: single-length, byte, literal# ootional floating 
point# and double- length. Single-length operands are 16-bit 
values with the sian bit assumed to be in the most signifi- 
cant bit. In arithmetic calculations# the single-length 
word is assumed to be a two's complement integer. Byte 
operands are considered 8-bit unsigned integers# and can be 
the most significant or least significant half-word of a 
memory location. Literal operands are 4-bit unsigned 
integers denoted by the ' m 1 field of a literal type instruc- 
tion. Floating point operands are formed using two adjacent 
registers or memory locations with fields containing the 
sign of the fraction# the characteristic# and 24 bits of the 
f r ac t i on . 



Ooud 1 e- 1 engt h operands are concatenated from two 
adjacent registers or two adjacent memory addresses. The 
most significant half of the double-1 ength operand is con- 
tained in the low-oraer register or memory address# and the 
least significant half in the next seauent i al register or 
address. The sign bit for both words resides in the hiqh- 
order bit position of the most significant half-word and 
when used in an arithmetic calculation, the double-length 
operand is treated as a two's complement integer. Instruc- 
tions involving doub 1 e- 1 engt h operands require the low-order 



register or memory address to be even/ since the adjacent 
cell adaress is formed by * 0 R * - i n g a 1 in the least signifi- 
cant bit of the address. 

2 , Instruction Format 

The AN/UYK-20 has five different instruction word 
formats/ designated by two-letter mnemonic codes. These 
codes are: R R (Register-Reqister)/ RL (Register-Literal)/ 
RI (Register-Immediate)/ RK (Register-Constant)/ and RX 
(Register-Inaex) . Each of these formats are desianated in 
the instruction wore by a combination of the opcode field 
and the subfunction code. Registers are identified in the 
'a' and ' m ' fields of the instruction/ and are referred to 
by the notation Ra and Pm. The 'y' field is treated as an 
address or arithmetic constant/ deoendina on the instruction 
(Figure 1 1 1 - 5 ) . 

The format RR single-word instructions perform 
operations involving the registers SDecifieo by the 'a' and 
1 m 1 fields. No memory references are made to access 
operands/ causing considerable savings in execution time. 
The 'a' or ' m ' field may be used to index special operations 
on registers. 

Format RL instructions use a lb-bit instruction 
word/ using one or two general registers. The 'a* designa- 
tor selects the general reaister Pa or register pair R a / Ra 
+ 1. The ' m ' designator contains the 4-bit literal value 
which will be used in the instruction. 



INSTRUCTION FORMATS 



Il5|l4|l 3|l2|l 1 |l0 


9 j 8 


7 [ 6 | 5 | 4 


3 | 2 i 1 i 0 


Op* 

1 


0 


a 


m 



a selects R a ; m selects R m . 



!l5|l4|l3tl2!ll(l0|9| 8 


7 1 6 I S J 4 


l3|2|-iT0| 


| Op* 


0 


a 


m 



a selects R a ; m contains 4-bit constants. 



RM 

Local 

Jump 



15|14|1 3|12[1 1 llO 


9 | 8 


'J|«|S|4|3| 2 |,|0 


Op* 


1 


d 



d is signed number of instructions to jump, relative to current position. 
(+ is forward; — is backward) 



RI-2 

Indexed 

Memory 



RK 



15|14|13|12|11|10 


9 [ 8 


“(61514 


3 | 2 | 1 ! 0 


Op* 


1 


a 


m 



a selects R a . m selects R m . R m selects memory address. 



(15|14|13|1 2fl 1 llOl 9 ( 8i 7 ( 6 | 5 1 4 1 


J 3 1 2 | 1 | 0 


°°- i 2 1 


a | 


m 




a selects R a , m 4 - 0 selects R m , m * 0 selects 0 
Operand = y + R m or 0. 



RX 



I5|l4|l3|l2!n|l0l9! 8 j 7 | 6| 5| 4 | 


I 3 I 2 1 1 ! 0 J 


Op* I 3 » a 


m 



15il4|l3ll2|n[lQ|9 j 8 1 7 
V 



a selects R a , m 0 selects R m , m = 0 selects 0. 
y ♦ R m or 0 selects memory address. 



•Op is operation code 



Figure III-5 C231 



R1 format single-word instructions are separated 
into two cataaories. RI Type 1 instructions are local jump 
instructions where the P-register is increased or decreased 
by the ' d ' field in the instruction. The ' d ' field 
reoresents the two's complement deviation value and allows 
the P-reqister to be altered a maximum of +177 octal or -200 
octal. RI Type 2 instructions involve operations that use 
the general registers Ra and R m , and a memory address speci- 
fied by the contents of Rrp. 

The format RK instructions contain 32 bits of infor- 
mation. The uooer lb-bits contain the operation code and 
the designator fields. The 'a' field selects the qeneral 
register R a, and the ’m' field denotes how the next word, 
containing ' y ' , is to be used. If ' m ' equals zero# then the 
effective operand address Y , equals ' y ' . If ' m ' does not 
equal zero, then V is formed by adding the contents of Rm to 
' y ' . 



Format RX are double-word instructions similar to 
the RK instruction tnat use direct and indirect addressinq 
techniaues determined by the ' m * field. If the ' m ' field 
equals zero, then direct addressing without indexing is 
employed, with the effective ooerand address formed from the 
' y * fielc itself. If the ' m ' field eouals 1 through 7 , 11, 
13/ 15, or 17 (octal), then direct addressing with indexing 
is employed. The effective operand address is formed by 
adding t K e contents of Rm to 'y*. An 'm* field value of 10, 
12, 1 a , or 16 (octal) indicates indirection is to be 



employed/ and the indirect address control fields in status 
register 2 contain information which is used to generate the 
effective operand address or a' pair of indirect words, When 
the control fields eaual 0 or 1/ then direct addressing with 
indexing results. If the control field equals 2 , then the 
contents of the first indirect word (I'M) is located at ' y 1 . 
Finally/ if the control field eauals 3 / then I W 1 is located 
at ' y * indexed by the contents of R m . Indirect word format 
is shown in Figure 1 1 1 - 6 . Cascaded indirection is possible 
provided that the indirect words are properly encoded. 

Byte addressing is accomplished using RX format 
instructions. A byte identifier (B) is used to determine 
which half-word (8-bit) is to be referenced. If B equals 0/ 
then tne most significant half-word in address Y is the 
operand byte. If 3 eauals 1/ then the least significant 
byte at Y is the operand. The least significant bit in the 
indexing register is used as the byte identifier/ and the 
remaininq 15 bits are used as the indexing value for finding 
Y. Indirect byte ooerand addressinq formulae are included 
in Figure III-o. The followinq formulae apply for direct 
byte operand addressing: 

m = 0 / Y = y / and B = 0 

m=l-7/ 11/ 13/ 15/ or 17 octal 



Y=y + (Rm)/2 and B=LSB of (Rm) 



The ©receding section has described the fundamentals 
of the AN/UYK-20 instruction formats, References 21 and 23 
should be consulted for further information concerning 
instruction formats and decoding. 



INDIRECT WORD FORMATS 

IW ^ 

I W 2 



OCTAL J 
VALUE 


ADDRESS DETERMINATION 


0 


Y * IW2; if byte mod*, upp*r byt* is used . 


1 


Y 3 IW2 ♦ (Rx); if byt* mod*. LS8 of R* d*t*fmtn<* byt*. * 


2 


Y * IW2 (Rm); if byt* mod*, LS8 of Rm det* mines byt*. * 


3 


Y * IW2 * (Rm+1); if byt* mod*. LSB of Rnwl d*t*fmm*s byt*. * 



15 14 13 12 11 10 9 8 7 6 5 A 3 2 1 0 



NOT 


1 


J 


NOT USED 


X 


USED 











IW 2 



I * 0 DIRECT ADDRESSING. OPERAND AT AODRESS Y 

I * 1, CASCADED INDIRECT ADORESSING. NEW INDIRECT 
WORD 1 AT ADDRESS Y 

• To d*t*fmir>* th* op«r»nd addf*u when in byt* mod*, th* co«m*i»'j jf R , or R_, or R_ _ .ate 

* m j fn ♦ » 

right shifted 1 bit position and iff 0 filled in the left most position 



Figure lll-b 1231 
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IV. BURROUGHS D-MACHINE 



A. HARDWARE DESCRIPTION 

The microprogramming facility at the Naval Postgraduate 
School is composed of a Burrouahs Interpreter-Based system. 
This system possesses the characteristic of being dynami- 
cally microProgrammable and is designed using a simple 
ouilding block structure. A typical system is made up of a 
numoer of interpreters (processors)/ main memory modules/ 
and input/output devices/ along with a switch interlock dev- 
ice (SwI) controlling data flow on the data bus connecting 
the interpreters to main memory and peripheral devices. The 
heart of the system is the interpreter/ also referred to as 
the D-mac h i ne . 

A D-machine possesses five functional modules: the 
logic unit (LU)/ the control unit (CU)/ the memory unit 
(MU)/ nanomemory (NM), and microprogram memory (MPM). The 
system presently installed in the Computer Science Depart- 
ment combines nanomemory and microprogram memory into one 
functional unit. The architecture of the D-machine is 
designed around 8-bit word slices. word lengths from 8 bits 
to b 4 bits are permissible usina the same functional unit 



(Figure I V - 1 ) 
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The D-machi ne used for this thesis has been configured 
as a 3 <? - b i t word processor. Reference 17 provides a 
thorough and concise description of the architecture of an 
interpreter-based microproaramming system. Reference 6 
details the SDecifics of D-machi ne microprogramming which 
must be thoroughly uncerstood by the programmer. 

1 . Logic Unit 

The D-machi ne's logic unit performs shifting/ arith- 
metic/ and logic functions and contains scratch pad regis- 
ters and data interfaces for the switch interlock. All 
adder operations are performed using two's complement arith- 
metic/ ang shifting is accomplished in a matrix of gates 
called the barrel switch. 

The scratch oad registers A 1 , A 2 , and A3 are identi- 
cal in function. They act as temporary storage registers 
within tne D-machi ne and serve as primary inputs to the 
adder. The control unit determines which register will be 
an input to the adaer. Any of the A-registers can receive 
the output from the barrel switch. 



The B register functions as the primary external 



i n p u t 


interface 


f rom 


the switch 


interlock. 


It acts 


as the 


second 


input to 


the adder and can 


rece i ve 


certain 


d i rect 


inputs 


from the 


adder’s 


arithmetic 


ope r ations. The b 


reg i s- 


ter can 


be 1 oaded from 


any of the 


f o 1 lowing 


sources : 





a. The barrel switch output. 

b. The adder output. 

c. External aata from the switch interlock (or con- 
trol panel switches). 

d. The memory information register (MIR). 

e. The complements of four bit or eight bit car- 
ries. 

f. The barrel switch ORed with the adder, external 
data from the switch interlock, or MIR. 

The output of the B register is filtered prior to 
gating into the adder. The 'filter* consists of 

true/complement selection gates which are divided into three 
sections: the most significant bit, the least significant 

bit, and all the remaining central bits. The output of this 
filter is the independent result of these sections and may 
be either all zeroes, all ones, or the true contents or 
one's complement of the contents of the respective bits of 
the B register. 

The memory information register is the interface to 
the switch interlock. MIR functions as a buffer to main 
memory or to a peripheral device. It is an output destina- 
tion of the barrel switch and its output can be sent to the 
B-register or the switch interlock. 

The adder in the loqic unit performs all arithmetic 
functions and can be categorized as a version of a carry 
look-ahead aader. The control unit can gate various combi- 
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nations of A , B # and Z inouts to the adder. An 'A 1 input is 
defined as an input from one of the three scratch pad regis- 
ters. A '6' input is the output of the B register’s 
true/complement filter dates. A 'Z' input is an external 
input to the logic unit and can originate from one of the 
following sources: 

a. The output of the counter in the memory control 

unit ( M C U ) wnich is gated to the most signifi- 

cant eight bits of the adder with the remaining 
bits zeroed. 

b. The output of the literal register in the MCU 
which is gated to the least significant, eight 
bits of the adder with the remaining bits 
z e roed . 

c. An optional incut which is gated into the middle 
bytes of the adder with the most and least sig- 
nificant bvtes zeroed. 

d. The output of the alternate microprogram count 
register (A'^PCR) in the MCU which is gated into 
the least sionificant 12 bits of the adder (13 
bits for 6 K micromemory machines) with all other 
bits being zeroed. 

e. All zeroes. 

Two inputs/ selected from the A, B# or Z sources# 
are always aated to the adder. These inputs are referred to 
as X-select and Y-select. An X-select input may be either 
an A input or a Z input. If it is not specified# it is 



SO 



assumed to be zero. A valid Y-select has eitner a B/ Z or 1 
as its input. Some Z inputs# however/ are valid only as Y- 
select inputs. Any combination of valid X-selects and Y - 
selects are permissible addends/ with an ootion of adding a 
one to the least significant bit. For subtraction opera- 
tions/ the value to be subtracted must be a Y-select; the 
Y-select input is subsequently two's complemented and gated 
to the adder. All binary Boolean operations between two 
adder inputs are allowed/ and dynamic condition bits for 
overflow (A 0 V ) / all bits true (ABT)/ and most/least signifi- 
cant bit test ( W ST/LST) are available to the control unit 
for testing. Bit testing is a valuable feature for decoding 
instruction words. 

The barrel switch is a matrix of gates that accepts 
parallel data from the adder and shifts the data any number 
of places to the left or riqnt with zero fill. It also can 
right shift the wore in an end-around fashion. The output 
of the barrel switch ray be directed to any of the following 
destinations simultaneously: 

a. The A registers. 

b. The o register. 

c. The remorv information register. 

d. The least significant lb bits to the MCU reais- 
t e r s . 

e. Tne least significant three to six bits to the 
control unit shift amount reaister (bit length 
depending on the word size of machine). 
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The Cont ro 1 Unit 



The control unit has five major sections: the shift 

amount register (SAR), the condition register (CO NO), part 
of the control register (CR), the decoder for microprogram 
memory content/ and clock control. This module of the D- 
machine manaaes the functions of the processor/ and is 
directly responsible for logic unit operation. 

SAR and its associated logic control shifting opera- 
tions by loading shift amounts into SAR and generating the 
required controls indicated by the current microinstruction 
for the barrel switch. In addition/ the SAR’s logic gen- 
erates the 'word length complement' of the SAR contents 
where the complement is defined to be that amount which 
would restore the bits of a word to their original position 
after an end-around shift of N followed by an end-around 
shift of the 'complement' of N, For example/ if an end- 
around right shift of 20 was required in a 32-bit D-machi ne ; 
another end-around shift of the complement of 20 (12) would 
be required to restore the contents to its original value. 

The condition register has four major functions: 

a. It stores 12 resettable bits which are used as 
error indicators/ interrupts/ status/ and 
lockout indicators. 

b. It selects one of 16 condition bits for perform- 
ing conditional operations. These 16 bits are 
composed of the 12 condition bits of the 



condition reaister d1 us the 4 dynamic conditions 
generated by the IU adder during the present 
clock time. 

c. It decodes bits from the memory for resetting# 
setting# or reauest i nq the setting of designated 
oits of the condition register. 

d. It resolves oriority between interpreters in the 
setting of global condition bits (GC)# thereby 
providing a method of controlling inter- 
interpreter lockout. 

The control reaister stores the control bits of the 
5 6 - b i t microinstruction that are not being used in the first 
phase of the execution cycle. The control register is sub- 
divided into sections which are used by the memory control 
unit# the logic unit# and the control unit during the execu- 
tion phase of a microinstruction. For a description of tim- 
ing and Phases# see section C of this chapter. 

3. Memory Control Unit 

The memory control unit provides the basic address- 
ing interface between the D-machine and both main memory 
(S-memory) and microprogram memory (M-memory). One MCU can 
address 6^K words (256K bytes) of main memory# and if the 
D-machine is configured with a second MCU# a maximum of 1?8K 
words can be addressed. 

The memory control unit has three major sections: 
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a. The microprogram address section controls the 
addressing of microprogram memory and the 
sequencing of microinstructions. It contains 
the microprogram count register ( M P C R ) / the 
alternate microprogram count register (AMPCR), 
the incrementer, the microprogram address con- 
trols register/ and their associated logic. For 
standard 4 K M -memories/ the MPCR and AMPCR are 
1 £ bits long. For 8 K M-memorieS/ MPCR and AMPCR 
are 13 bits in length (the D-machines installed 
at the Postaraduate School employ 8K M- 
memories). AMPCR is a Y-select input to the 
logic unit adder . 

b. The memory/device address section contains the 
8-bit memory address register (MAR)/ two lb-bit 
base registers (BR1 and 6 R 2 ) , output selection 
gates/ and associated control logic. When form- 

i no a memory address/ the lower eight bits of a 

/ 

base register and MAR are concatenated. The 
concatenated 2^-bit contents of BR1/ BR2 and MAR 
( B M A R ) is a valid Y -select incut to the logic 
unit adder. 

c. The Z register section contains two registers 
which are Z inputs to the logic unit adder. The 
liberal register (LIT) is an 8 -bit register into 
which constants are loaded. An 8-bit counter 
(CTR) is used in conjunction with a counter 
overflow condition bit to control iterative 
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also contains 



looping. The Z register section 
selection gates for the loadable counter and its 
associated logic. 

4. Microprogram memory (M-memory) 

A D-machine may have a dual or sinqle microprogram 
memory scheme. As indicated earlier, the D-mach i nes used in 
this emulation project had a single microprogram memory, 
consolidating the microprogram memory and nanomemory into 
one 5 b - b i t programmable store memory, often referred to as 
M -memory. Microprograms, consisting of 56-bit microinstruc- 
tions, are dynamically changeable oy the user, thus distin- 
guishing the D-machine as an extremely flexible computing 
device. 

The seouencing of microprogram instructions is con- 
trolled by a condition bit procedure which determines the 
successor command to be executed. M -memory provides data to 
the condition testing logic which then determines which con- 
dition is to be tested. The output of the condition testing 
logic is a true/false signal that is gatec to the successor 
selection logic. This loaic then selects between the three 
true and three false successor bits also provided by the M - 
memory word. Tne three selected bits provide eight possible 
successor combinations: 
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a . 
b. 
c . 
d. 
e . 
f . 
< 3 . 
h . 



WAIT Receat the current instruction. 

STEP Step to the next instruction. 

SKIP Skip the next instruction. 

JUMP Jump to another M - memory address. 

PETN Return from a microprogram subroutine. 

CALL Call a microprogram subroutine. 

SAVE Step and save the instruction address. 

EXEC Execute one instruction out of sequence. 



The Particular successor command chosen controls 
gates wnich select the appropriate ^-address from MPCR or 
AMPCR ana provides incrementina logic for generating the 
next ^-memory address. Except for the EXEC command/ the 
N’PCR is loaded wi tn this M-memory address. 



B. NPS MICROPROGRAMMING FACILITY 
l . Physical Description 

The Computer Science Department of the Naval Post- 
graduate School Possesses a Burroughs Interpreter-Based Sys- 
tem consisting of three interpreters (processors) also known 
as D-machines# a 6^K 32-bit word/ main memory module/ a card 
reader/ a dual cartricge disk drive/ a line printer/ and a 
Datamedia 2500 CRT functioning as a supervisor's console 
(Figure IV-2). All input and output is performed through a 
single D-macnine processor/ hardware conf i aured with device 
dependent ports (DDP) for peripherals and the external 
clock. 
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CLOCK CONTROL 




NAVAL POSTGRADUATE SCHOOL MICROPROGRAMMING FACILITY 
BURROUGHS INTERPRETER-BASED SYSTEM 



Figure I V-2 
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After initial liqht-off and bootstrap/ the system is 
configured into two Burroughs 6700 - LIFO ALGOL Stack- 
machines/ each addressing 3 2 K of memory and each communicat- 
ing with the input output processor (IOP) in a pseudo- 
multiprocessing environment. Software/ written in ALGOL/ is 
provided which runs on the 8 - b 7 0 0 system. A resident moni- 
tor control program and disk file manager control the 
maintenance of system files and the execution of jobs in a 
batch environment. Both D-machines compete for system jobs 
input from the card reader or CRT. Other software available 
includes an ALGOL compiler (a derivative of ALGOL 60)/ a 
microprogram translator called TPANSLANG, a line editor/ and 
a simulation program for microprograms. TRAN SLANG provides 
the medium by which microprograms are written. User 
microprograms loaded into a D-machine change its identity 
and destroy the Stack-machine previously loaded. 

2. Input /Ou t du t Interface 

Pivotal to the operation of, the Burroughs system is 
the inout/outout interface. Only one processor/ the IOP/ 
communicates with peripherals/ and the other D-macbineS/ 
configured as Stack-machines/ must compete for its services. 
The IOP communicates asynchronously/ using a conventional 
’handshake* metnod. Since all interpreters have access to 
the main memory module* a communications link has been esta- 
blished using the upper two 32-bit words of main memory. If 
a otack-macnine wishes to communicate with the IOP/ it 
places a message into address 65/535 known as the 'mailbox’. 
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and issues an interruct ( I NT ) to the IOP. The IOP periodi- 
cally tests INT, and if set » will retrieve the contents of 
the mailbox (and mailbox - 1 if required) and perform the 
aesired ooerat i on. The other Stack-machine is locked out 
from interrogating the IOP until it has completed processing 
the reauest. Normally, the operation requires transferring 
a buffer to some output device. When the IOP has completed 
honoring the request, it places a completion code in the 
mailbox ana sets INT for the Stack-machine requesting the 
I/O. This interpreter must halt execution and check mailbox 
to see if the 1/0 was performed successfully, completing the 
hanoshakinq process. This Protocol permits both Stack- 
machines to perform input/outcut independently of each 
other, provided both maintain strict memory boundaries. 

Another function of the IOP is interfacing character 
code formats between the peripherals and the other two D - 
machines. When the machines are configured as Stack- 
machines, characters are passed to the IOP in 6-bit BCL 
(Burrouahs Common Language). The IOP must convert this 
character set to ASCII for output to the line printer and 
CRT. A similar translation must be made for input data con- 
verting from either EBCDIC or ASCII depending on whether the 
input source is the card reader or the CRT. 

3. 4/ emory Interface 

Since all three D-machines must share the 64K of 
main memory, a priority scheme was developed to resolve 
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memory reference conflicts. The main memory module is actu- 
ally a s i nq 1 e-po r t ed ? 32-oit word* core memory which can be 
made to apDear mul t ioorted usina a switch interlock unit 
(Srtl) developed by Burroughs. The switch interlock controls 
the main data bus of the system, and resolves conflicts 
using a priority scheme. The D-machine with the highest 
priority is tne 1 0 P ? with the priority of the other two 
machines being relative to their physical proximity to the 
IOP. Once a memory reference has oeen made? a 0-machine 
may continue execution without waiting for a completion sig- 
nal from the switch interlock. Although this technique of 
memory referencina minimizes unnecessary delay? it restricts 
the program from chancing the read or write addresses or the 
content of MIR (write only) prior to a completion signal. 

C. MICROINSTRUCTION TIDING 

The Burroughs D-machine initiates a microinstruction 
once every clock cycle. The 0-mac h i nes utilized for the 
AN/UYk-BO emulation operated from a one mHz internal clock? 
which produced a clock pulse once every microsecond. A D - 
machine designed with an eight mHz clock? emitter-coupled 
logic (ECL)r and a faster memory cycle time? however? could 
execute eight times faster. This implies that advances in 
circuit technology can permit emulations to achieve improved 
speed and performance with no change in the m i c rop rograms . 

Every microinstruction is executed using one or more 
sequential time periods? called phase 1 » phase 2 ? and phase 
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3. A phase is a constant interval of time equivalent to one 
clock duration measured from <he trailing edge of each suc- 
cessive clock pulse. Some microinstructions only reaui re 
phase 1 to complete execution. Some reaui re phase 1 and 
phase 3 # and still others reaui re phases 1 # 2 # and 3. A new 
microinstruction is initiated at each clock cycle# allowing 
for overlapping of microinstruction execution in phase 1 and 
phase 3 . 



Microinstructions consist of two types, 
microinstruction# events can take place in al 



In a type 1 
t h ree phases : 



Phase 1 : 



Phase 2 : 



Phase 3 : 



condition testing, (conditional) external 
operations or (conditional) logic opera- 
tion initiation after completion of a 
prior logic operation, and successor ^ - 
memory address control. 

a holdina chase for phase 3 logic opera- 
tion controls. 

the completion chase for loqic unit opera- 
tions and destination register gating 
specified by loaic operation. 



During the optional phase 2 Period# a type 1 microin- 
struction execution completion is held in abeyance while a 
subseauent type 2 instruction is executed. A type 2 
microinstruction requires only phase 1 to complete execu- 
tion# and involves literal assignments to three registers: 
LIT# SAR, and A l 4PCR. Phase 2 may also be initiated if the 
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next secuent ial type 1 instruction does not execute its 
ditional logic operation and therefore can complete its 
cut ion in phase 1 (Figure I V - 3 ) . Appendix 0 of Ref, 6 
complete discussion of microinstruction timing. 



con- 
ex e- 
has a 
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PHASE 1 (FETCH) 



PHASE 3 (EXEC) 




M - MPM ACCESS TIME 

COND TEST AND SUCC DET - CONDITION TEST AND SUCCESSOR DETERMINATION 
BSW - BARREL SWITCH 

DEST - BARREL SWITCH OUTPUT DESTINATIONS; I.E., REGISTERS (B,CTR,ETC. } 
AND THEIR INPUT LOGIC 

C.R. - COMMAND REGISTER AND ASSOCIATED LOGIC 

AIS ADDER INPUT SELECTION FROM COMMAND REGISTER 



Timing Analysis, Type I Instructions 



Figure 1 V ” 3 [17] 
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TRANSLANG 



Microprogramming on the D-machine is accomplished using 
a microtranslator/assembler called TRANSLANG. TRANSLANG 
allows the Drogrammer to write microinstructions mnemon i - 
cally without concentrating on the bit patterns that compose 
the microinstructions themselves. TRANSLANG is written in 
ALGOL, the language of the B6700 Stack-machine. Nearly the 
entire languaae of TRAN SLANG is composed of reserved words 
recognized by the ALGOL program. tacn reserved word has a 
special meaning whicn causes the translator to construct 
particular microinstructions. A TRANSLANG instruction is 
eaui valent to one microinstruction consisting of the set of 
parallel D-machine functions performed during the clock 
phases. TRAN SLANG is a free form language and instructions 
may oe written in almost any order. Multiple instructions 
may apoear on a line, seoarated bv a period TRAN SLANG 
constructs include iterative mechanisms, input/output, 
assignment functions, control transfers, and Boolean, and 
computational operations. In addition, TRANSLANG permits 
label definitions and symbolic references for program con- 
trol flow. Reference b is the programming manual for 
TRANSLANG and contains the complete syntax for the language. 
Appendix A of Ref. 10 documents additions to the TRANSLANG 
instruction recert iore. 

A nicrcorogrammer may construct complicated microin- 
structions tnat perform many different tasks, some interact- 
ing closely with D-machine clock timing. Microinstruction 
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gating to several cevices permits a single TRAN SLANG 



instruction to accomplish some or all of the following 
actions: 

a. test a condition. 

b. set/reset a condition. 

c. initiate an external operation. 

d. add. 

e. shift the result of an add. 

f. store the result into several registers. 

q. increment a counter. 

h. complement a shift amount. 

i. determine the successor microinstruction. 

By judiciously composing his microprogram, a programmer may 
minimize execution time by taking advantage of microinstruc- 
tion phase overlap anc using highly parallel microcode. 

The TRAN SLANG assembler constructs an object program 
consisting of non-rel ocatabl e 56-bit microinstructions. 
TRAN SLANG maintains a cross reference table that resolves 
laoel references during assembly. The object code created 
is stored on a disk and may be loaded into the micro memory 
of a D-m. achine using a special control word recoani zed by 
the operating system. Once loaded/ the D-machine assumes 
the control structure dictated by the users microprogram. 
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V. AN/UYK-20 EMULATOR 



A. EMULATION DESIGN 

1. Functional Components 

The architecture of the AN/UYK-20 emulator micropro- 
gram was developed usina general auidelines provided by 
references and previous emulation experience on Burroughs 
D-machines l 1 0 J . The first decision incorporated in the 
emulator desian was to integrate the entire emulation within 
one D-machine. Since the D-machines had the capacity to 
handle nearly S K of microinstructions* no microprogramming 
caoacity limitations were envisioned. Tne design objectives 
of a modularized* wel 1 -documented* structured microprogram 
could also be real i zed. 

.\ith the emulator entirely contained in one D - 
macnine* several secondary benefits also existed. First* 
the emulator was more immune to hardware oroblems. If one 
D-machine was malfunctioning* the emulator could still be 
run on the alternate D-machine. Second* it was possible to 
have two AIM/UYK-20 emulators resident in the system at one 
time. Not only would two emulators speed uo testing of the 
inoividual emulation* but they would also permit their even- 
tual use in a system configured for AN/UYK-20 mul t i orocess- 
ina. The third anc final benefit of a totally integrated 



emulator was recognized during the design of the 
i nout/output controller (IOC). Since the AN/UYK-20’s IOC 
was capable of independent processing, an emulation of its 
IOC would not be possible within the same 0-machine. An 
emulation of the IOC could be accomplished in a second D - 
machine, which woulc behave as an independent channel for 
the O-machine configured as the AN/UYK-20 processor. 
Although this emulation does not attempt to emulate the 
AN/UYh-^0 IOC, the fact that a second 0-machine exists makes 
its implementation a realistic extension to the project. 



The 

lowing the 
the 1 owes t 
f o 1 1 owing 
the 1 oade r 

which s i an i 



emulator program organization was created f o 1 - 
basic Guidelines of Ref. 17. The loader occupied 
section of 'M-Tiemory with the emulator microcode 
sequentially. Microprogram control passed from 
to the emulator via the execution of a *G' card 
fied execution commencement. 



The emulator microprogram was organized into three 
modules positioned sucn that forward address referencing 
would ne minimized in the TRAILS LANG assembler to save time 
and soace. The utilities section was in the lowest portion 
of the emulator, since its routines were referenced fre- 
auently by the succeeding sections. Individual subroutines 
within the utility section were organized alphabetically. 



The instruction and memory fetch routines comprised 
the next module. These routines incorporated all fetch 
options availaole in tne AN/UYK-20 emulator. 
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The opcode routines occupied the last section of the 
emulator. Tne opcodes were arranged numerically with 
further indexinq determined by the remaining fields of the 
individual instruction. Figure V-l shows the M-memory map- 
ping of the AN/UYK-20 emulator layout. Appendix F contains 
the emulation program listing. 

2 . Main Memory Organization 

Of primary importance in the emulation design was 
the logical organization of tne Burroughs system's main 
memory. Since tne ANVUYK-20 was a lo-bit word machine* it 
was decided t n a t only the lower half-word of a Burroughs 
32-oit word would be used oy the emulator. This design res- 
triction permitted tne addressing structure of the AN/UYK-20 
to oe directly projected on the D-machine. A one-to-one 
correspondence between the memory address of the AN/UYK-20 
and the Burroughs system was also achieved. 

Since tne number of hiah-speed registers available 
in the D-machine was small* a IK portion of main memory was 
reserved for the A N / U Y K - 2 0 addressable register set* the 
page address registers* the temoorary storage space* and the 
emulation ouffers. Ihe mappinq of the AN/UYK-20 hiqh-speed 
registers to main memory greatly simplified the micropro- 
gramming requirements of the emulation* but added consider- 
able execution time overhead. Memory access times for the 
mapped registers were much slower than the actual transfer 
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rates of the AN/lJYK-20 registers. Figure V-2 shows the com* 
d) ete main memory mapDing of the emulation reserved storage 
area. 



3 . Emulation Program Status word 

Although the emulation duplicated all the AN/UYK-20 
registers/ the registers which comprised the program status 
word (PS'/O possessed uniaue character! st i cs . Status regis- 
ter 1 was combined with the program address register to form 
a single iP-bit PS/*. The emulator's PS to always co-existed 
in tne Al register of the D-machine and in main memory dur- 
ing execution of AN/UYK-^O programs. Tne fields of SRI were 
modified to include certain emulator toggles/ along with the 
normal condition bits (Figure V - 3 ) . The condition bits for 
DMA and non-destructive read only ( f\i D R 0 ) mode were removed 
from tne SRI since they were hardware features of the 
AN/UYK-dO that would not oe emulated. 

Status reaister 2 , the remaining lb bits of the AN- 
UYK-20's P S / was not resident in any D-machine registers 
during emulation execution for two reasons: the emulation 
could not afford the luxury of ^8 bits of reserved register 
space/ and SR2 was less f reauent 1 y referenced than SRI and 
the P-register. Consequently/ SR2 had to be read from its 
reserved location in main memory. The contents of the upper 
lb nits of SRP's memorv location also contained additional 
emulation toggles which were used by the debugger package 
(Figure V-3 ) . 
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MAIN MEMORY MAPPINGS 



DECIMAL octal 

ADDRESS ADDRESS 

******************************* 



0 


- 


15 


0 


- 


1 7 


1 6 


- 


31 


20 


- 


37 






32 






40 






33 






41 






34 






4? 






35 






43 






3b 






44 






37 






45 


38 


- 


43 


4b 


- 


53 


44 


- 


49 


54 


- 


61 


SO 


- 


53 


62 


- 


65 


54 


- 


hr 


66 


- 


167 


120 


- 


121 


170 


- 


171 


1?? 


- 


141 


1 7? 


- 


215 


1 42 


- 


152 


216 


- 


230 


1 S 3 


- 


1 85 


231 


- 


271 


186 


- 


205 


272 


- 


315 


2 0 6 


- 


229 


316 


- 


345 


230 


- 


Ibl 


3 4 6 


- 


1 377 


766 


- 


1023 


1 4 0 0 


. 


1777 



USE 

**************************** 
GENERAL REGISTER STACK 1 
GENERAL REGISTER STACK 2 
PROGRAM STATUS WORD 
BREAKPOINT REGISTER 
STATUS REGISTER 2 (SR?) 
NEXT LOAD ADDRESS 
CLOCKTIME 
REAL TIME CLOCK 
WORKSPACE (TEMP STORAGE) 
STACK (TFMP STORAGE STACK) 
I/O COMMAND WORDS (IOCW) 
UNUSED 

HEX ADDRESS FOR INPUT 
INPUT CARD BUFFER 
UNUSED 

OUTPUT PRIM BUFFER 
CRT BUFFER 
ERRORLIST 
UNUSED 

PAGE ADDRESS REGISTERS 



Fiaure V - 2 
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AN/OYK-20 EMULATION 
PROGRAM STATUS WORDS (PSW) 

ACTIVE PSW: RESIDES IN LU REGISTER A1 (MEMORY ADDRESS 32) 




INACTIVE PSW: RESIDES IN MEMORY ADDRESS 33 



[ STATUS REGISTER 2 ] 







MINI ITT"! 1 N 


I 


| 


“P 


~r 


TTTTTTT1 












INTE 


inte: 

RPRE' 


INTERRUPT CODE ( 
RPRETED IF m»10 
TED IF m»12 


3 

INTERPRETED IF m-14 


9 

INTERPRETED IF m-16 




3 

NOT USED 




TRACE OUTPUT (CRT=1, PRT=0 ) * 



TRACE FLAG* 



* FIELDS ADDED FOR USE BY THE EMULATOR 



F ioure V - 3 
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J . 0 - Machine Registers 



with only a limited number o f high-speed registers 
available in tne 0-machine/ the emulation design had to 
include a xel 1 -develooed d1 an for their usage. Since only 
seven registers in the D-machine were lb bits or longer/ 
they were used exclusively for manipulating the AN/UYK-20 
addresses/ instructions/ ana data. The register environment 
had to be consistent throughout every execution cycle to 
allow utility routines to be cal 1 eo in the same manner by 
all oocode subroutines. The primary goal of the emulation 
was to use as few memory references as oossibl e to conserve 
execution time and memory space. Judicious use of the 
existing registers was a necessity. 

Several factors had to be considered before select- 
ing a register for a particular “^unction. The most impor- 
tant consideration was its flexibility within the 0-machine. 
Tne B register/ for examole/ was used as a general purpose 
register/ since its contents could be oated through a mas<- 
inq filter orior to being utilized. A secona factor was the 
type of operand it could contain. Ooubl e-woro operands 
( 3 2 - o i t ) ccula only be stored in the 32-bit logic unit 
registers while lb-oit operands could also be stored in the 
memory control unit registers. 

The' A 1 register contained the emulation’s 32-bit 
orogram status * o r a (PS '/* ) at the commencement of the emula- 
tor program. It was not affected by individual instruction 
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microcode, exceot wnen incrementing the PAR, or when a par- 
ticular instruction modified the PSh. Emulator toggles 
resident in SRI could not be altered by an AN/UYK-20 
instruction because their settings were independent of pro- 
gram execu t ion. 

The A3 register held the instruction word for the 
duration of its execution cycle. Each field of the instruc- 
tion could be decoded and interpreted from the A3 register 
without having to retrieve it from memory. Once the 
instruction had been completely decoded, A3 was mane avail- 
able as a scratch cad register. 

The A2 and 6 registers were used as scratch pad 
registers during each opcode execution cycle. In general, 
their contents were volatile, except wh.en they were specifi- 
cally documented in the the proaram listing. The contents 
of A 2 , for example, was not altered in the EMULIN 
subroutine, because of its use in the calling fetch routine. 
A 2 ana H were manipulated as either single or double- word 
operands during the arithmetic operations. 

The only remaining logic unit register was the 
memory information register (MIR). MIR was used for storing 
information into memory and as a temporary storage location. 
Intermediate results were deposited in MIR durinq instruc- 
tion execution and returned tnrough the B register. 

The base registers, BR1 ana BR2, were usea as 
storage for addresses and sinqle-lenqth operands, and for 



temporary storage of intermediate results. In addition, all 
memory addressinq in the emulation was accomplished us i nq 
the 1 o^er 8 bits of B R 2 and MAR ( M A R 2 ) . These memory con- 
trol unit registers had to be used carefully, because they 
required a sequence of several microinstructions to proper I y 
reference their contents. 

B. LOADER 

The loader incorporated into the AN/UYK-20 emulation 
provided a simple mechanism for loading AN/UYK-20 instruc- 
tions into main memory (S -memory) of the Burrouqhs micropro- 
gramming system (Figure V-4). Its control word repertoire 
was flexible, allow ina a variety of AN/UYK-20 program 
environments. Job control statements were included to exe- 
cute and halt individual programs anywhere in S-memory. 

The loader module consisted essentially of a scanner and 
a translator written in the microcode. Information was read 
into a 20 -word buffer from cards or CRT input, then the 
buffer contents were scanned for control code consisting of 
one or more characters. Once these characters were inter- 
preted, control was passed to the translator section which 
decoded the rest of the data in the buffer and performed the 
reauired ^unct i on. The translator section consisted of a 
variety of routines t h a t handled specific control words in 
the loader repertoire. The loader control statements, how- 
ever, had to appear in a looical sequence (See Appendix A). 
All loader control statements are contained in Appendix B. 
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AN/UYK-20 LOADER 




Figure V - ^ 
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The actual interface p i o e 1 i n e between the loader and the 
emulator consisted of two small emulator utility routines 



which started and stoooed 


the 


external 


clock of the 


Bu r - 


roughs 


microorooramming 


system. 


T hese rout i nes 


were 


i nse r t ed 


for emulation timing 


du rposes , 


and provided a 


'stop 



watch' for AN/UYK-20 orograms. The time recorded (in mil- 
liseconds) by this ' stoo watch' was placed in a reserved 
memory location in the emulation memory map/ and could be 
read using either a trace control instruction ('T')/ or a 
machine status control word ( ’ M ' ) . 

C. THE FETCH MODULE 

In order to emulate AN/UYK-20 execution and memory 
referencing^ fetch microcode was developed which incor- 
porated memory aadressina algorithms and instruction fetch 
routines. Since both data and instructions were equally 
accessible from the processor, the memory addressing scheme 
was closely 1in<ed to the instruction fetch concept. Data 
ana instructions coulc oe interspersed throughout memory, 
and prooer proaram execution required that the program 
address register point to an instruction word. 



The 

EMULIN 

V -o ) . 
point 
status 
o rde r 



emulation used two routines for memory addressing, 
for reading, and E M UL0UT for writing (Figures V- 5, 
These subroutines performed both paging and break- 
checking, depending on toggles set in the program 
word. Paaina was incorporated into the emulation in 
to gaune the execution overhead required in emulating 
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the AN/UYK-20's oaainq scheme. The paging scheme imple- 
mented in the emulator divided main memory into 256-word 
pages, instead of 1024-word pages used bv the AN/UYK-20. 
Since the Burroughs D-machine was organized for 256-word 
pages, the microprogramming required for the paging was 
straightforward. The 256 page address registers resided in 
the emulator's memory maooina, each initialized to the page 
number cor resoondi nq to their relative address (0-255). The 
pacing algorithm imitated the AN/UYK-20's method of page 
addressing, and included the setting of a page modification 
bit. 

The breakpoint option was added to orovide a method of 
debugging AN/UYK-20 programs, once the emulation was com- 
pleted. t ' A U L I N and E V U L 0 U T tested toggles set in the PSlrt to 
determine if breakpoint read, write, or both was desired. 

The memory addressing convention required all memory 
references from IK to 64K to use EMU LIN and LMULOUT. All 
memory referen ces to the memory mapping area (0-1023) did 
not use these routines, but instead utilized absolute memory 
referencing microcode. 



7ft 



AN/UrK-20 £ M u L IN 




F i aure V -5 
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AN/U'rh-S^ LMULOU 7 




F i qu re V -6 
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The instruction fetch 



rout i ne 



(IFETCH) 



re t r i e ved 



instructions from main memory based on the contents of the 
program address register. Since IFETCH used E^ULIN to read 
the instructions from memory, it could operate in a paging 
environment, with or without breakpoints (Figure V-7). 
IFETCH was also responsible for incrementing the PAH, and 
for test ina the trace toggle prior to fetching an instruc- 
tion. IFETCH was capable of retrieving any instruction word 
in the program address soace (1024-65,535). In the event 
that the uooer memory limit was reached, IFETCH would set 
the PAH to 1024 and continue execution. Trace toggle test- 
ing was inserted into IFETCH as part of the built-in debug- 
ging package. Since IFETCH was called prior to every 
instruction, it was the logical choice for placing a call to 
the debugge r . 
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AN/uyk^O I F £.7 c H 




F i qijre V - 7 



0. OPCODE IMPLEMENTATION 



All of the £05 individual instructions emulated were 
microprogrammed usinc an identical instruction decoding 
mechanism. The routines that Derformed the reauired opera- 
tions were terminal nodes on a large tree network, whose 
root was the OPCODE routine. OPCODE fetched an instruction 
and isolated the operation code field. The binary value of 
the oocode fiela was an index to an operation jump table 
containing individual opcode M-rn e n 0 py addresses. Vi thin 
each oocode routine »as microcode which isolated the sub- 
function 1 f ' field of the instruction and used its value as 
an index to the next level ot opcode analysis. Depending on 
the oocode, this last jump could identify which instruction 
*as to be performed. If it did not, further analysis of the 
' m ' or 'a' fields provided the final index to the instruc- 
tion. Instruction formats are described in Apoendix C. 



After the instruction had been executed, control passed 
b a c < to the opcode routine, and then the execution cycle was 
repeated for the emulation of the next AN/UYK-£0 instruc- 
tion. The AN/UYK-20 programmer could cause this micropro- 
gram loop to be exited by either inserting an executive 
return instruction (03,0, a, 00) which caused a 'priority 
interrupt' and halted program execution, or by coding an 
instruction t n a t was not implemented, not assigned, or 
caused a division overflow. The last three cases caused an 
execution fault, while the former resulted in normal program 
termination (Fiaure V-F), 
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AN/lJYK-ro EMULAIIUN GENERAL S1RUCIURE 




F i aure V -8 



8a 



The instruction fetch mode of OPCODE assumed that the 
PAR always oointed to an instruction word. Double-word 
instructions always had their 'y' field fetched after they 
were determined to be two words in length. Conditional 
double-word instructions performed their condition test 
oefore the *y* field was fetched. If the test failed, the 
PAR was incremented by two prior to returning to OPCODE for 
continued execution. 

The OPCODE routine was also written to accommodate the 
AN/UYK-PO 'remote execute' instruction. when this operation 
was performed, a bit was set in the P S W which would indicate 
that one instruction out-of-line was being executed. For 
this operation, the current P S d was stored in memory, and 
the PAP was loaded'with the address of the instruction to be 
executed. The OPCODE routine always checked the remote exe- 
cute bit during an instruction cycle. If the bit was set, 
it would fetch the instruction indicated by the remote exe- 
cute PAR, and restore the actual PAP, incremented by two, 
into the PS/i. 

Some of the opcode repertoire of the AN/UYK-PQ was not 
emulated. Those instructions that were not emulated, how- 
ever, retained slots in the opcode hierarchy for future 
inclusion. Any instruction not implemented by the emulator 
caused the machine to fault, and printed an error diagnostic 
on the selected cutout device ('MOT IMPLEMENTED - EXECUTION 
ENDS '). Similarly, locations were reserved for those 
instructions not assigned bv the AN/IJYK-PO were reserved 
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locations in the emulation . This permitted the emulator to 
oe responsive to any future AN/UYK-<?0 hardware modifica- 
tions. Whenever an instruction to be executed was not 
assigned/ the emulator Generated a fault interrupt and 
printed an error diagnostic on the selected output device 
(’FAULT INTERRUPT - EXECUTION ENDS’). 

E. UTILITIES 

Although each emulated instruction performed different 
operations/ each depended upon a common set of utility 
suoroutines to accomplish their task. These subroutines 
varied in complexity/ but each performed a function that 
contributed to successful instruction execution. A simple 
operation often required register addressing/ condition code 
setting/ and memory addressing/ oefore the task was com- 
pleted. 

Utility subroutines were included in the emulation when- 
ever feasible to simplify m i c roo roa r amm i no and to alleviate 
programming redundancy. These subroutines were called using 
the successor command constructs available in TRAN SLANG. 
Depending on the purpose of the utility routine/ parameters 
were passed via D-^achine register(s) or condition bit(s). 
This information was utilized by the utility in determining 
what operation was to be performed. In the carry 
subroutine/ for example/ a local condition bit was passed 
whicn indicated the appropriate condition code to be 
inserted into the PS. a j. A more complex example/ the RX 



86 



format utility routine; required two parameters to be set by 
the instruction. Register A3 contained the instruction word 
and L C 2 was set or cleared depending on whether or not byte 
formatting was required. The Px routine called other 
routines and could perform consioerable processing before 
the final result; the effective operand address; was 
returned to the calling routine via the 8 reqister. 

Tne utilitv section encomoassed a number of routines 
which performed complex data manipulation. Arithmetic util- 
ities for multiplication; division, and square root were 
accessed by nine separate AN/UYK-20 instructions. Indirec- 
tion routines, whicn were called by the PX format routines, 
emulated the AN/UYK-20 cascaded addressing c aoab i 1 i t i es . A 
general purpose move subroutine permitted up to 256 cells of 
main memory to be moved from one location to another. This 
routine was used by the emulator's error diaqnostic utili- 
ties, as well as the load and store multiple address regis- 
ter instructions. 

F. INPUT /OUTPUT CONTROLLER 

A 1 t h ouch the AN/UYK-20's IOC was not emulated, several 
of its design features were imitated in creating a general 
purpose i nout /output controller for the emulator. The 
incut/output command instruction (35 PR) initiated the I/O 
sequence, and reserved several cells in the memory mapping 
for I/O control words (Fiaure V - 9 ) . These control words 
contained fields wrier indicated what peripheral device was 
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AN/UYK-20 EMULATION 



INPUT/OUTPUT COMMAND WORDS 



IOCW (50) 



1514 13 12 II 10 9 8 7 6 5 4 



3 2 10 



INPUT ( 0 ) / OUTPUT ( 1 ) 

NOT USED 

BUFFER COUNT 



,.P-SVI_CE-ID. «. 



BUFFER ADDRESS 

THE MICROCODE ALTERS THE IOCW + 1 
DURING BUFFER TRANSFER 



2 i 1 1 1 m m ii i i 1 1 n i 



BUFFER SIZE 
USED FOR CRT OUTPUT 



TT 



XT 



NUMBER OF SECTIONS 

USED FOR DISK TRANSFERS (NOT IMPLEMENTED) 
* CODE: 00 - CRT 

01 - PRT (OUTPUT) /CRD RDR (INPUT) 

10 - DISK (NOT IMPLEMENTED) 



F iqure V -9 
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selected, the number of buffers that would be passed, 
whether inout or cutout was desired, and where the buffer 
was located. Two additional words were reserved for control 
data reauired to interface with the disk and the CRT. The 
disk inout/outout m i c rocode was not incorporated into the 
controller, but the framework was provided so the IOC could 
be readily inserted in the future. 

The emulator's IOC section was located in the utility 
section of the emulation, and operated asynchronously with 
the Burroughs IOP. Since the IOC was collocated with the 
emulator in the same O-machine, it was not capable of 
independent orocessino. Once it was initiated, emulation 
execution waited until input/outout was completed. The emu- 
lation nad to use the Burrouans Common Language, since the 
IOP was microproaram^ed to accept only this character set 
from other O-machines. 

In order to transfer a 33-word line printer buffer to 
the IOP, the Ah/UYK-20 emulator IOC had to use 66 Af'j/UYK-PO 
words. In order to send 20 words to the CRT, the IOC had to 
contruct a 40- word Puffer. The CO^PR&S subroutine packed an 
AN/UVK-20 buffer of 66, lo-bit words into a 3 2 - bit, 33-word 
buffer, suitable for cutout to the IOP. 



8R 



S i m i 1 a r 1 y , on incut* a 20-word IOP buffer from the card 



reader or CPT exoanded into a 40-wora AN/UYK-20 buffer. In 
this case* the EXPAND subroutine split every packed 32-bit 
word of the lOP’s buffer into two 16-bit words* suitable for 
processing by the emulator. These additional data transfor- 
mations were reduirea because of the buffering requirements 
of the Burroughs IOP. 
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VI. EMULATION TESTING 



A. METHOD OF TESTING 

The fundamental test i na technique utilized throughout 
the develooment of the emulation consisted of three distinct 
phases: 1) each module was i ndeoendent 1 y assembled and 
debugged until successful assembly was achieved/ 2 ) each 
program seament was then tested for accuracy of the intended 
operation/ and 3) tne composite emulation program was tested 
under actual program conditions allowing for interaction 
amonq several modules. A debuaging package/ which was 
developed early in the emulation project/ was the primary 
test vehicle for verifyina emulation coding. Development 
and test i na of the loader was/ however/ completed prior to 
the creation of the debugger. 

Loader test i na was accompli shed, by monitoring the con- 
trol oanel lights and synthetically halting program execu- 
tion via the insertion of * to A I T ' statements/ forcing the 
desired register to be transferred to 'MIR'/ and then re- 
examining the canel lights. This process was extremely 
tedious and time consuming but proved to be an invaluable 
tool for recoani zinc peculiar TRAN SLANG constructs and 
microinstruction execution side effects. 
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Each module was subjected to an extensive aesk checking 
process in order to reduce trivial assembly errors before 
runnino it on the machine. After a successful assembly had 
been achieved# the code was merged with previously existing 
assembled modules. The emulator was expanded as more 
modules were added# with utility routines being incorporated 
Drier to the oocoae programs. 

Initially# the loader was designed# programmed# and 
thorouahly tested crior to the microprogramming of any. 
AN/UYK-20 instructions or utilities. i t h the loader imple- 
mented# t h e emulation of the UYK-20 instruction repertoire 
coulc orocede with a minimum of loader induced problems. 

Independent opcodes and various utility routines were 
added to 'the previously assempled programs. The final emu- 
lation consisted of a total of 205 opcodes# a set of utility 
programs# and a workable loader. 

In the next phase of testing# every module was subjected 
to a representative number of test cases which demonstrated 
how closely they coToaren to the documented AN/UYK-20 opera- 
tion. Artificial environments were created to subject every 
opcode to a variety of situations. Whenever the operation 
of the opcode disagreed with the documented AN/UYK-20 opera- 
tion# the opcode's function was thoroughly researched. The 
opcode's side effects were categorized and questions formu- 
lated. Often the answers could only be obtained from the 
Uni vac field engineer. 






To illustrate the complexity of test i nq# an ado instruc- 
tion was tested with numerous operand combinations: two 
positive operands/ two negative operands* one of each sign 
yielding a negative result* opposite signs producing a posi- 
tive result* and oooosite sians producing a zero sum. Dur- 
ing each addition operation, the overflow, carry, and condi- 
tion bits had to be monitored in status register 1 to verify 
tneir appropriate set t ioa. This level of detail was 
achieved w i t n all of the implemented opcodes in order to 
produce an efficient and accurate emulation. 

Throughout the entire testinq scenario* which composed 
20% of the emulation project* the debugger routine (DUMPREG) 
provided the necessary information for examining opcode exe- 
cution. A represen t at i ve sample debugger output is provided 
in Appendix D. 

DUMPREG Permitted a snapshot of both AN/UYK-20 general 
register stac<s* PS'A* SRI* and S R 2 , in addition to the D - 
machine registers ( A 1 , A 2 , A3, 6 P 1 * A M P C R * MIR) and the Bur- 
roughs external c 1 o c < . DUMPREG possessed sufficient flexi- 
bility to be incorporated into the microcode at any point. 
In the final emulator, however, the debugger is user speci- 
fied, and will dump the AN/UYK-20 emulator environment 
either at every instruction fetch and program stop, or when 
called by a loader control card. 



93 



B. SAMPLE TEST PROGRAMS 



After subjecting the entire emulation to extensive test- 
ing# some representative test orograms were developed to 
demonstrate the feasibility of the emulation and its capa- 
bilities and performance as compared to an actual AN/UYK-20. 
There were two programs which were selected because they 
incorporated numerous emulation features. The two Programs 
were the solution of simultaneous linear eauat ions by 
Cramer's rule ana generation of prime numoers. It must be 
noted that streamlined program design was not emphasized but 
rather utilization of a variety of opcodes ana features of 
the emu 1 a t i on . 

The program for solving linear eauat ions contained a 
total of 28 opcodes# reauirina 28 opcode execution cycles 
and 43 instruction fetches. The orogram demonstrated all 
four fundamental mathematical operations# numerous store and 
load functions# a comparison test ana a jump instruction. 
The capability of performing card reader inout and output 
was added when the emulator IOC was completed. 

The pr’ime number program demonstrated 30 instructions 
which reguireo 116 opcode execution cycles# and 122 instruc- 
tion fetches. This program illustrated numerous comparison 
tests# loooina structures# several jump instructions# a load 
multiple instruction# addition# and division. The test pro- 
grams are i nc 1 uded in Apoengix E. 
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C. TEST RESULTS 



The performance analysis of the test programs consisted 
primarily of running numerous test cases/ examining the 
results for accuracy and computing the total time required 
to execute the emulated AN/UYK-20 programs. The timing of 
the programs was accomplished by using an external clock 
available on the TCP. The clock time provided a fairly 
close reoresentat ion of the emulation execution timer but it 
cannot be considered a completely accurate measure since the 
emulation must interrocate the IOP to retrieve the external 
clock contents. Apo r o x i ma t e 1 y 50 microseconds are used when 
sampling the clock. 



The time recui regents of the AN/UYK-20 program execution 
were h ana-c a 1 c u 1 a t ed by summing the published instruction 
execution ti^es as presented in Ref. 21. The emulation per- 
formance ratio (EPR) was computed merely to give an approxi- 
mate indication of the emulation performance. The EPR is 
the ratio of measured 0ur rouahs emulation time to the calcu- 
lated AN/UYK-20 execution time. Two EPR figures are 
recoraea : one with oagina implemented and one without. The 
paging EPR figure is siani f icant only in that it indicates 
how much additional overhead must be incurred when emulating 
the Paging mechanism of the AN/UYK-20. Tne required addi- 
tional execution time was about 2 b % . Naturally/ paging 
overhead is directly proportional to the number of instruc- 
tion fetches or memory references performed during a pro- 
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The results of program testing are as follows: 

\ 

CRAMER'S PULE PRIME NUMBERS 



Number of Cocodes 


28 




30 




Number of Fetches 


4 3 




122 




Execution Cycles 


28 




116 




Time w/o oaninq 


5000 


usee 


1 3000 


usee 


Time with Paging 


8000 


usee 


1 7000 


usee 


AN/UYK-20 Time 


78 


usee 


193 


usee 


EPR w/o Paging 


6 4 


1 


67 : : 


1 


EPR with Paging 


103 : : 


1 


88 : : 


1 



These figures represent only approximate comparisons of 
the two machines. These computations provide an estimate of 
emulation characteristics. An effective EPR without paging 
was projected to be 65::1. 
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VII 



SIJVMARY AND RECOMMENDATIONS 



A. EXPERIENCE vjITH HARD/.ARE 

The emulation Droject provided the unique opportunity of 
learning aoout two computer systems. The burroughs D - 
machine demanded a detailed knowledge of hardware ooeration, 
as well as a thorough unoerstandi ng of the microprogramming 
language, TRANSLANG. The computer architecture and proces- 
sor cacao i lit i e s of the AN/UYK-20 had to be investigated and 
then integrates into the control store memory of the 
Bur rough ' s D-machine. 

On several occasions, haroware malfunctions with the 
Burroughs eguioment crevented normal system operation. The 
card reader was inooerable for several weeks, and the disk 



drive 


unit 


had to 


be reoaired several times. 


During the 


final 


three 


weeks of 


project 


development, one 


D-machine 


ceased 


t 0 


funct ion 


proper 1 y . 


This restricted 


e m u 1 at i on 



testinq to. the remaining D-machine. 

Although these hardware difficulties impeded normal pro- 
gress of the project, they did not prevent the emulation 
from successfully being completed. If a hardware problem 
prevented emulation testing or debuaaing, other modules were 
designed and test i nn was postponed. This permitted con- 
tinual emulation development regardless of the haroware 
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status. In addition, considerable time and effort was 
invested in t roub 1 e*shoot i nq Hardware malfunctions/ so they 
could be isolated/ diagnosed/ and recaired. 

B. LESSONS LEARNED 

The emulation project brouqht together many different 
computer science techniques and disciplines which will be 
useful in future computer science endeavors. A great deal 
of experience in both computer architecture and operation 
was gained in two different types of computers. Micropro- 
gramming orovideo new insight into computer desi an and 
revealed many potential applications for programmable con- 
trol store machines. 

Since emulations normally require a large development 
effort/ this project had to incorporate judicious system 
design and project management principles in order for it to 
be completed successfully. Careful monitoring of critical 
staqes of the emulation, ana coordination of the programming 
effort to meet scneduled reauirements/ was necessary 
throughout this research. The small programming team con- 
cept proved to work extremely well in this project. 

Finally/ several software practices were strictly fol- 
lowed that proved tc be invaluable in constructing the emu- 
lation. First/ modular oroaramminq succeeded in partition- 
ing the emulation into discrete modules whicn could be 
designed/ cooed/ tested/ and implementea individually. The 
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emulation was constructed in segments, using the previously 
verified modules as a test bed for the new modules beinq 
built. Structured programming and prolific documentation 
were useful in developing each microorogram routine because 
they permitted each team member to understand the function 
of the program and how it could be used. 

C. emulation PROBLEMS 

The most significant problem of the emulation was not 
having had any experience with an AIWUYK-20 and not havinq 
anyone readily available with prior AN/UYK-20 experience. 
This lack of knowledge created some anxiety when attempting 
to analyze the ramifications of individual opcodes. 

The idiosyncrasies of certain opcodes were not made 
self-evident oy the AN/UYK-20 software manuals. Conse- 
quently, numerous code modules were redesigned when more 
detailed information was provided by a UNIVAC field 
engineer. This proved to be very time consuming, ineffi- 
cient, ana frustrating. 

Another emulation difficulty was the lack of working 
registers in the host machine as compared with the tarqet 
machine. Pi h i 1 e tne AN/UY'K-20 has either lb or 32 general 
registers, depending on whether the second stack option is 
incorporated, the 0- machine has effectively only seven work- 
able registers containina lb bits or more. Therefore, all 
AN/UYK-20 registers had to be mapped into S-memory which 
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createa much longer register read/write times. This created 
significant reaister manipulation problems which in some 
cases reguired main memory references for instructions 
intended to be strictly regi ster-to-regi ster operations. 
Consequent 1 y? increased execution times resulted in a higher 
emulation performance ratio ( E P R ) / decreasing the overall 
emulation performance. 



D. RESULTS 



Emu latino the AN/UYK-PO on a Burroughs D-machine 
required a consideraole amount of preoaration and planning 
before anv results were realized. An in oepth analysis of 
each computer's architecture ana operating characteristics 
was conducted to insure that an emulation was feasible in 
the allotted time period. From the inception of the pro- 
ject/ the goal of tne emulation was to implement a standard 
AN/UYK-PO processor. This decision was basea on the capa- 
oility of the D-macnine ana an estimate of how much time 
would be involved in developing/ debugging/ and testing the 
final product. Although this goal was achieved/ it was felt 
more time could have been devoted to test inq and verifying 
emulation ooerat ion. 

The AN/UYK-PO emulation was a highly complex micropro- 
gramming project involving numerous aata structures and 
transfer protocols. A total of 205 AN/UYK-PO instructions 
were emulated out of nearly PRO instructions in the reper- 
toire (including all IOC/ math oac/ and clock interrupt 
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opcodes). A.N/UYK-20 aiaonostic programs and user programs 
will establish the validity of the emulator's construction. 

E. RECOMMENDATIONS and F0L10N-0N TOPICS 

Although 305 oocoaes have been implemented/ tested/ and 
developed into a working emulation/ there are still many 
challenging avenues tc pursue in creating the complete emu- 
lation package. First/ test i na is a continuous process and 
should be performed in conjunction with code optimization. 
A comprehensive code optimization effort could improve the 
E p R without sacrificing code readability. 

Second/ the floating point and 'math oac ' options could 
be implemented. The addition of floating point arithmetic/ 
trigonometric/ and hyperbolic functions would significantly 
strengthen the scientific capabilities of the emulation. 
This would be an extremely strenuous undertaking but would 
permit more tactical data applications of the emulation/ 
increasing its value to the Nevy. 

One area which could be examined is to perform a 

comprehensive timing analysis between an AN/UYK-30 and the 

\ 

emulation. This coulc consist of collecting numerous bench 
mark programs from Navy AM/UYK-20 installations/ where per- 
formance data could be accurately obtained. These programs 
could then be run on the pmulator, and analyzed with the 
Denci mark results. The feasibility of replacing or substi- 
tuting emulation host machines for target machines could be 
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addressed and supported by the timing analysis study. 

An emulation of the AN/UYK-20 input/outDUt controller 
could be incoroorated into the emulation without serious 
difficulty. Since a second 0-machine is available, the IOC 
channel processor instructions could be emulated and 
inserted into t n a t D-rracnine's control memory. The indepen- 
dent processi no characteristic of the AN/UYK-20 IOC would be 
fulfilled usina tnis arrangement. 

Finally, when the Burrouahs system is linked with the 
computer science department's POP 11/50, the AN/UYK-20 emu- 
lator could oe connected to that system's peripheral 
resources. This would allow future incorporation of an 
AN/UYK-20 ULTPA assembler and a C M S - 2 M compiler into the POP 
11/50 system which could then produce machine language 
object coae files for the AN/UYK-20 emulator to execute. 
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APPENDIX A 



AN/UYK-20 EMULATOR USER'S MANUAL 



This aescnot ion is designed to provide sufficient 
information to operate the AN/UYK-20 emulation program. It 
is assumed that the informal Burroughs D -machine manual in 
the Burrouahs laboratory will suooly adequate power-on 
instructions and solve any hardware operating difficulties 
which may arise. 

The procedure for utilizing the AN/UYK-20 emulator can 
be dividea into four chases: 

1) selection of the necessary loader control cards 
( JCL) . 

2 ) selection o * the AN/UYK-20 program instruction set. 

3) implementation of chases one ana two into the 
required card cr CRT format. 

4 ) actual hardware implementation on the Burroughs 
D-machine resultina in program execution. 

Pnase one can oe achieved by selecting the desired 
loader control cards described in Appendix B. The card for- 
mat is identical to the CRT format# except that the CRT 
requires the user to <carriage return> at the end of every 
line of incut data. 

Phase two is accomplished by creating the AN/UYK-2Q 
oroqram from a subset of the 205 emulated instructions. 
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Phase three consists of 
AN/UYK-20 program in the format 

The resulting job a e c k will 
f o 1 lows: 

? S M L 0 A D TED/UYK20 -OBJECT 

L ooooa 

C 
C 

C 002<?2 DIVIDE OVERFLOW 

************************ 
other des i red JCL 
************************ 

A N / U Y K - 20 Proqram 

0 3 , 0 , a , 0 0 

(a = 00-17 octal ) 



* (or Ml) 
E 



the desired JCL and 
illustrated in Appendix C. 

typically be assembled as 

X loads the emulator into 
m i c romemo r y 

% load the program into 
page 4 



FAULT ENTERED" 



X priority interrupt 
(mandat ory card) 

% commence proqram 
execut ion 

X machine status ( reg. dump 
at termination) 

% end job card 



k ey ounc n i nq 



0 0 2 0 o FAULT INTERRUPT-EXECUTION ENDS 

0021 4 NOT r*»PLEMENTED-EXECUTI0N ENDS 
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Finally/ ohase four consists of the entire oroaram deck 
beinq loaded into the card reader. It is assumed that the 
IQP is at address 0015 hex/ the selected interpreter is at 
address 0549 hex/ the line printer is 'READY', and the IRQ 
switch is 'off'. 

The IRQ switch is a three-way toggle switch mounted on 
the right side of the interpreter. In the up position, IRQ 
is 'on', horizontally it is 'off', and the downward position 
is used for external functions. 

If either the interpreter or the IQP is not at the 
proper starting address, clear them py depressing the 
'CLEAR' button on each unit. If this Procedure does not 
remedy the situation, consult the user's manual in the 
laboratory. 

Uoon reading the ? S L 0 A D card, the system will load the 
AN/UYK-20 emulator object program into the micromemory of 
the selected interpreter. The program address counter (on 
the interpreter) will be at the beginning of the emulation 
program, address 0000 hex. At this point, the user can 
select CRT input and output by placing the IRQ Switch to the 
'on' (upward) position. If CRT incut is not desired, leave 
the IRQ switch in the 'off* (horizontal) position. Next/ 
force step the interpreter by momentarily depressing the FST 
button (uppermost push button on side panel of the inter- 
preter). Do not hold in the FST button. This may cause 
so^e undesirable side effects. 
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After aeoressina the F$T button, the AN/UYK-20 program 
is 1 oaoea into S - m e m o r y . If tne input is expected from the 
CRT, the user must enter his program from the console. Oth- 
erwise, the emulator will request cards from the card 
reader. Once the program nas been loaded and the 'O' card 
read, orogram execution begins. 

After successful program termination, the orogram 
returns to the loader and asks for more input. At this 
point, tne user may terminate his job, or start another load 
sequence. It should be remembered that the AN/UYK-20 emula- 
tor has been designed for monoo r og r amm i nq execution. It 
executes one program at a time, but can execute any number 
of programs in seauent i al order if desired. hhen the job is 
terminated, the D-machine returns to its starting address 
(0000 hex) and awaits further processing. rthen the user 
returns to the start address, the system is effectively 
’master cleared' since S-^emorv will be cleared prior to 
executing any further jobs. It is not necessary, however, 
to use the ?$r*'L0AD card for additional programs or program 
re-runs because the emulator object cooe still resides in 
m i c romemo r y . 

If the results of program execution are not as antici- 
pated, use of either a * T ' or * T 1 ' option (trace card) is 
recommended to provide t h e user with a f e t c h -oy - f e t c h pro- 
gram trace with the output to the printer or CRT respec- 



tively 



APPENDIX B. LOADER CONTROL CARD FORMATS 



CARD COLUMNS 



1 2 3 4 5 6 7 8 9 1 0 1 l 12 13 14 15 16 - 20 



a 


R 

B 


decimal 


add r 




c 




decimal 


add r 


character string followed by a " 


D 




decimal 


add r 


data 


E 


G 




decimal 


ado r 




I 


1 

2 




rea» 


da t a 


L 




dec • cage 


n u m 




M 

1 


P 


P 




dec. numoer 




S 


1 

2 








T 


1 









Note: All numeric fields are in decimal. 
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LOADER CONTROL CARD DESCRIPTION 



Control C a r d 

Identitifier Description 



B 



The '6' cara is used to implement the 
breakpoint feature of the emulation. 
This features allows the user to specify 
address in columns 4-8. 
contain a R or W to break- 
read or write operation 
The default condition is 
both read and write. 



a decimal 
Column 3 must 
point on a 
r espec t i ve 1 y . 
brea<coi nt on 



C The ' C 1 card is used to insert character 

st rinqs into memory. The character 
string starts in column 9 and continues 
until terminated by a Quote symbol ("5 
or the string reaches column 80. The 
decimal address where the character 
string will be written is given in 
columns 4-8. A blank or zero address 
field causes the character string to be 
inserted at the current load address. 



D The 'D* card is used to store decimal 

data from columns 16-30 into the memory 
address found in columns 4-8. A blank 
or zero address field causes the data to 
be inserted at 'the currrent load 
address . 



E The 1 F ' card is used to indicate the 

end-of-job and therefore is a mandatory 
control card for job seoaration. 



G The * G • card or 'GO' card is used to 

start execution. The starting decimal 
address is in columns 4-8. The default 
value of a blank or zero address field 
will cause oroaram execution to start 
ado res s 01034. 
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The 'I' card or set index register card 
is used to store decimal data from 
columns 18-20 into the register desig- 
nated in columns 7-8/ into the general 
register stack (1 or 2) specified in 
column 2 . 



The ' L * or load card is used to Parti- 
tion memory into 256, 2be>-word (32-bit) 
oaaes and to load the program into the 
decimal cage as referenced by columns 
4-8. Pages 0-3 should not be used 
because they contain the reguired emula- 
tion register mapping and established 
workspace. The * L * is a reouired con- 
trol card, and it is recommended that it 
be the first JCL card. 



The ' M ' card or machine status card pro- 
vides a register dump/ whereever it is 
insertea. It is normally placed between 
the * G * and * E * cards in the JCL deck. 
If a 1 is olaced in column 2 / the 
machine status dump will be sent to the 
CH7. This dump will contain the current 
value all AN/UYK-20 general registers/ 
the p S *' / S P 2 / next instruction address/ 
the breakpoint adaress/ and clocktime. 



The ' P* card is used to implement pag- 
ing. 

The *R' or reserve space card is used to 
reserve memory space as specified 
necimallv in columns 4-8. 



The ’S' card is used to simulate setting 
of the program stop switches (1 and 2 ) 
on the AN/UYK-20 maintenance panel. 
Column 2 must contain a 1 or a 2 , Two 
cards are reouired if both switches are 
to be set. 



The * T 1 or trace card provides a fetch- 
cy-fetch program trace/ dumping the 
entire machine status on every fetch 
cycle. If a 1 is in column 2 , the trace 
will be displayed on the C P T . This card 
is recommended for oebugging programs. 
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APPEND I X C 



AN/UYK -20 EMULATOR INSTRUCTION FORMAT 



Format 

****★★★* 

RR, RL, 

R I Type 

R I Type 

RK , RX 
Notes: 

1 ) 

2) 

3 ) 



Card Columns 

a********-*************************************** 



567891011 121314 

opcooe t 1 f * / 'a' , ' m ' 



5 6 7 6 o 10 11 12 

l opcode / ' f ' t ' d ' 

5 b 7 3 9 10 11 12 13 19 IS 16-20 

oococe / ' f 1 , 'a'/ ' m ' , ' y ' 



All fields are in octal with the exceptions of 
addresses and the 'v' field which are decimal 
All fields must be zero-filled 

The followinc field restrictions must be followed: 



Field 


Range 


Base 


Opcode 


00-76 


octal 


’ f ' 


0-3 


octal 


' a ’ 


00-17 


octal 


' m ' 


00-1 7 


octal 


'O' 


000-377 


octal 


' y ’ 


00000-65535 


decimal 


add r 


00000-65535 


dec i rra 1 



APPENDIX D. SAMPLE DEBUGGER OUTPUT 



This appendix provides a sample program output illus- 
trating the Trace option. The debugger is called at the 
beginning of every instruction fetch/ and at the termination 
of the program. Space has oeen provided for dumping 48 
memory addresses including emulated AN/UYK-20 registers and 
D-macnine registers. Each output line consists of eight 
regions printed in hexadecimal with a total of eight hex 
digits (33 bits) per region. 

The listing is annotated to indicate the identity of 
each output cell. A summary of cell definitions (numbering 
from left to riaht/ too to bottom) follows: 



decimal address description 

******************f*************-***************** 



0 


- 15 


General Register Stack 1 


1 6 


- 31 


General Register Stack 3 




33 


Proaram Status flord (PSIrt) 




33 


Breakpoint Reqister (BREAKPT) 




34 


Status Reqister 3 (SR3) 




35 


NEXT INSTR (next load address) 




3o 


C 1 oc k t i me 




37 


Real-Time Clock (RTC) 




38 


AMPCP 




3R 


A 1 




40 


A 3 




41 


A3 




43 


MIR 




43 


OR l 


uu 


- 4 7 


Unused 



ANO STORE RFSULI IN REG 



o © © © © o 

© © c © © o 

© O © © fs. © 

© o © © w © 

o O C C © © 



C*o £ J o' £ c 



O O O © LL o 
© © c © *■* © 
O O © C © © 



o © O © © O 

c o o c o © 

© O O © LL © 

© o o © o c 

© © 00^0 



© © © © © © 

o o c o u. g 

© © c © © © 

c © © © «-» © 



o © © © *o c 
u g g g ^ C 
© © © © © © 
© © © © © © 
© © © © © © 
© © © © © © 



©coo©© 
© c© © © © 
© o o c © o 



© © © © © © 

© © © © © © 

© © © © © © 

© © © © © c 

© © © © © © 



c o © © © o 
©©©©©© 
© © © © © © 



©©coo© 
© o o © © © 
© © © © o © 
~ o © © © © © 

>- © © © © © o 



© © © © © © 
o © © o © © 
© © © © © © 
© © © © © © 
© © © © o © 
© © © © c © 
o © © © c c 
© © © c © © 



© © © © © © 
© o © © © o 
© © © © © © 
© ©CO©© 
© © © © © © 



© © © © © © 



© © © © © r. 
©coo©© 
© © © © o © 

C CO o o o 

© © © © © © 



z> — 

Q © *- 
5 © < Z 

i oa. « 



— ©©o©o©c 

©COOO© 

<©0*-©©©©©© 

oarz>z>coo©o© 

LU *-0©00©0© 

1-crxggoggc 



© © © o © © 
c © © © o © 
© © © © © © 
©©©©CO 

© o © © © o 
o © © © © o 



o o © o © © 
© © © © © © 
© © © o © © 



© © © © © c 
o © o © © c 



) U O O O © K) 
3C©n©©o© 
j L o g© o ?g 
f o © © © © © c 

JOOOOOO© 

0 © © © © © 
© © o © © © 
© 0 © © © © 



© © © © k> ca 
© © © © © © 
© © © © 3* C 



© © © © © © 
© © o © © © 
© © © © © c 



© © CJ © m eg 
O O O © O o 
© © C © 3* © 
© © o © © © 
o © © o © 0 
© © © © © © 
© © © © © © 
c © 0 © © © 



© 0 © © K> eg 
©00 ©or* 

© © © O ST © 
© © © © © © 
© 0 © © © © 
o © © © © © 
C- © © © © © 

o © © © 0 c 






D © O ^ 



> © © c 
i u o c 



> © © c 

-» © © c 

5 © 0 c 

> © © c 



© © © 

in ^ rv © o © 









© © © c ? a> 

©00©©© 

© © O © © o 



o © n © o 

© © © C =X VC 

C-OCO©© 
© © © © © © 

© © © © o 

© © © © © © 

© © © © © c 



o 0 © © © m 

LL ©00©© 
© © © © © © 
© © © © © c 
0 ©00©© 




© © © o < 
©CO©: 
© © © © < 



o c c © 1 



© © © © o 
© © © © o 
© © © © © 
©00©© 



in © © © ** © 
r c> r o ft r*. 

© o © © © © 



sr © © © © © 
000 000 
3 * © © © V © 
© © © © © © 
© © © © © © 
© © © © rs © 
© © © © © © 
© © © © c © 
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APPENDIX E . SAMPLE TEST PROGRAMS 



The programs listed in this appendix were designed to 
illustrate how t^e AN/UYK-20 emulator can be used for 
develooina programs. The first two programs presented are 
the generation of prime numbers and the solution of simul- 
taneous linear eauat ions by Cramer's Rule. They depict 
several AN/UYK-20 control structures such as looping, itera- 
tion, and condition code testing as well as numerous load, 
store, and arithmetic operations. Tne last program performs 
the Cramer's Rule algorithm and demonstrates input/output 
routines. 
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•LINEAR EQUATIONS 8T CRAHERiS RULE 
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ENTR* POINT FOR REPETITIVE 1XECUT10K" 
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THE INPUT COEFFICIENT VALUES AND CONSTANTS FOR EACH ECUATION MUST BE 1NSERTFP 
USING LOAOER CONTROL CAROS IN T Hf FOLLOWING LOCATIONS 
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APPENDIX F. EMULATOR LISTING 



This appendix provides a listing of the TRANSLANG 
assembler output of the AN/UYK-20 emulation. A source file 
copy of the emulator exists on disk as well as on cards. A 
microinstruction object file is also maintained on disk. 

The listing is divided into four sections. The left 
most section contains the microinstruction address composed 
of four hexadecimal diqits followed bv a 56-bit microin- 
struction created from a TRANSLANG instruction. The center 
section contains the TRANSLANG source which includes labels/ 
an instruction/ and/or a comment field. The final number 
printed on the riqht most side of the listing is the 
seauence number of the TRANSLANG source statement. This 
numoer/ printed in c e c i m a 1 / is created by a Burroughs 
software utility program called CARD-LIST. This number must 
be used when editing source programs on disk. 
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MAIN MEMORY AODRESS MAPPI 
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THIS ROUTINE WILL READ COLUMNS 1-4 AK'O 
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RX TYPE COMPARE MASKED 
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